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BI48 SEQ ID N0:1 

ATCGCC6ATCCAAT6CCAAGGA6TACCTG6GCAATCAGA6CCTACTCACGGCT6CCGGG6CCGGCATTGCCAA6CTCCTG 
GACGCCGACGAGAACAACACCAGTACCGTCTTCAGCGGCAACGGCACCAGCTTCGGGACGACCGGAACCAACAGCAACTC 
GGCCCTCAACAGCATCCTCTCCGGCGGCGTCAGCGACATCCGGCAGTGGATGAACAAGTTGTACGGGGAGGCCTTCGCCG 
CCGTCTACGTGCAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTGGCGATCGACTATGAACTCAAGGGCCGC 
AAGGTCGATTACAGCTCTGGAGCCGCTCATGCAACAGCAGACTTGGACTAACCCCCTTCTTCGTCTCTGCGCCGGCCTGG 
CCTGCGCGCTGACCCTGGCAGCGTGCTCCACCAGCAAGGAGGAGATGCTGCCCCACGGCGAGGCCAACATGCTCGACGTC 
TGGGAGCGAGGTGCGACCAGCTCGATAGGCAACAGCCGTGGCCGGCTGCTCCTCGATGCCAGGCAAACGCTGCGGCGCCC 
AATCGATCCGCAGCAGGATGCCTCCGCGAATGACCAGGrc^^ 

TCAAACGACTGCCCAATCCCGACCTGGTGATGTATGTGTTCCCGCACCTGGCCGGCAGCGATCCCGCCCCGGTACCGGGC 

TA(^CCACCGTGTTCCCCTTCTACCA(X:GAGTCCAGTACGCCATGCCG(MCGAACGCACGGAGGACTATTGAT^ 

TTTCAAACCCTTCTGCGCGGTCGCACACAGCCTCAGTCGGTACCGGCAGACGCTCCCGAAGATTCAGGAGCGCTGGACGT 

AGCGGCCGCGGAAGAAGCGACT(3AGC(K:TATCTG(K:GCGACTGGCCGCCATGGGTATTCCTCT(K:CCAACACCGG(^(^ 

AGAATGGCGCCACGCAGGCCGAAGCGTCACGCCTCTACGATCACGACCCATCGTTCGTAGACCTGCTGCCCTGGGCTGAG 

TACCTGCCCGACGAGCAAGTGATGCTCCTGGAGGATGGGCGTTCGCGCGCCGCATTCTTCGAACTGGTGCCCTTGGGCAC 

CGAGGGCCGCGATCCCAATTGGATGCAGAACGCCCGGGACGCATTGAAAGAAGCCCTGCAGAACTCCTTCGACGAGCACG 

AAACCTCACCCTGGATTGTCCAGTTCTACGCCCAGGACGAGATCAGCTGGGACAATTTCCAGGAGCAGTTGAGGCAGTAC 

GTCCATCCTCGAGCGC(3AG(&TCGGCCTTCA(m<M^ 

GCCGGGCGGACTGTTCGTCGACACCGCCGTCAGCAAGCTGCCCTGGCGAGGACAACAGCGCCGCGTGCGGATGGTCGTCT 
ACCGCCGGATCCGCAAGGAGGATGCGCAGATTCGCGGACAGGACCCGGCGGCGTACCTGAAATCCATCTGCGAGCGTATC 
CAAGGCGGCCTGGCGAACGCCGGCATCGTCGCTTCGCGCATGGGCGGACAGGAGATCAGGAACTGGTTGATCCGCTGGTT 
CAACCCGCACCCGGATCACCTCGGCCAGGCCGAGGCGGACCTACGTCGCTTCTACGAACTGGTATGCCGTCCGGACGAAC 
CGATCCTGCAGGATGAATTGCCACTGGCCGACGGCACTGACTTCTCCCAGAACCTGTTCTATCGGCAGCCTGTTTCCGAT 
GCCACCCAGGGCGTATGGCTCTTCGATGCCATGCCGCACCGAGTGATTGTGGTCGACCAGTTGAACAAAGCGCCGCTGAC 
AGGCCACTTCACCGGCGAGACGCTCAAAGGCGATGGCCTCAACGCCCTGTTCGATCGAATGCCCGAGGACACGCTGCTGT 
GCATCACCATGGTCGTGACGCCGCAGGA(aTGCT^^ 

CAGGCCTCGATCCACACCCGCGAGGACGTGGCCACCGTTCGACGCCTGATCGGCCGGGAGCACAAGCTCTATCGCGGAGC 
GATCGCTCTGTTCGTGCGCGGCCGCGACCATACCCAGTTGGAGGAACGCTGCATCACCCTGAGCAACGTACTGCTCGGCG 
CCGGCCTGGTGCCGGTCGAACCGCAGAACGAAGTCGGACCGCTGAACAGCTACCTGCGCTGGCTCCCCTCAAACTTCGAT 
CCAAACGAGAAGCGAGCCCTGGAGTGGTACACCCAGATGATGTTCGCTCAGCACATCGCCAACCTGTCGCCCATCTGGGG 
GCGCACCACCGGTACCGGACACCCTGGCTTCACGCTGTTCAACCGTGGCGGCGCGCCGTTGACCTTCGACCCGTTCAACA 
AGCTGGACCGGCAGATGAATGCCCACGGCTTCATCTTCGGGCCAACTGGCTCCGGCAAGTCGGCGTCCCTGACCAACCTC 
ATCTGCCAGATGCTCGCCATGTACCTGCCGCGGATGTTCGTCGCGGAAGCGGGCAACAGCTTCGGCCTGCTGGCCGACTT 
AGCCAAGCGGTTTGGCCTCTCGGTCCACCGGGTGCGCCTCGCCCCGGGCTCCGGCGTCAGCCTGGCGCCGTTCGCGGACG 
CCATCAAGCTGGTCGAGAGCCCCGACCAAGTGAAGGTGCTGGACGCCGAAGACATCGAGGCCTCGGACTCGGTCCAGGGC 
AGCAAGGCCGACCTCGAGGACGACCAGCGAGACATCCTGGGCGAGATGGAGATCGTCGCCCGCCTCATGATTACCGGTGG 
CGAAGAGAAGGAAGATGCGCGCCTGACCCGTGCCGATCGCAGCGCCGTCCGCCAGGCGATCCTGGCGGCGGCCAGGACCT 
GCGCCGCCGCGAACCGCACGGTACTGACCCAAGACGTGCGCGATGCGCTCTACGAGGCCTCCAGGAGCGATAGCACCGCG 
CCAGAACGCCGCGCGCGGATCGCCGAAATGGCGGAAGCCATGCAGATGTTCTGCATGGGCGCCGACGGCGAGATGTTCAA 
TCGCGAAGGCACGCCCTGGCCTGAGGCCGACCTTACCGTGGTGGATTTCGCAACGTACGCGCGCGAAGGCTACGCCGCCC 
AGCTCGGGATCGCCTACATCTCGCTGCTGAACACCGTGAACAACATCGCCGAACGCGACCAGTTCAAGGGCCGGCCAATC 
GTCAAGATCACCGATGAGGGGCACATCATCACCAAGCACCCGCTGCTGCTGCCCTACGCCATGAAGATCACCAAGATGTG 
GCGGAAACTGGGCGCCTGGTTCTGGCTCGCCACCCAGAACATCGACGACATCCCAGCCTCCGGGGCGCCGATGCTGAACA 
TGATCGAGTGGTGGTTGTGCCTGAACATGCCCCCCGACGAAGTAGAGAAGATATCCAGGTTCCGCGAGCTGTCGCCGGCG 
CAGAAGTCGATGATGCTCTCGGCCCGCAAGGAAAGCGGCAAGTTCACCGAGGGCGTGCTCCTGGCCAAGGGCAAAGAATA 
CCTCGTCCGTGTGGTTCCCCCGAGTCTCTACCTGGCCCTGGCCATGACCGAAAACGAAGAAAAGAACCAGCGCTACAACA 
TCATGCAAGCCACCGGCTGCGACGAGCTCGAGGCGGCCTTGCAGGTCGCAGCGGATCTCGACAAGGCGCGCGGCCTGCCA 
CCCTTCCCCATTGTTTTCCCAGACCAACCGGCAGTGGAGTGCCAGGACGAATGAGAGTTCTGAATTCGCTGACCCAGAAC 
CTGATCGACAACCTGACCCAGATCCTGCAGAACCCCGAAGAGGATGCCCTGCAGACGCTAAGGATATGCGCTCCTGTACT 
GATAGAGGAGCTGCAGCAGATTCAACTGAGGGCAGTCGATCGCCGGGATATCGTCCCGCAGATAAAGCAGCTCTTGGATG 
AATGGCTGCAACAACATCCACAGCCTGATACGGCCCAACAGGCGCTCATTGAGGCCGTGGACCGCGCGGAGATCCTACAG 
CGGAGGCAAGCGTGAGACTCTTGAAGGGCGGCTGGGCAGCCAAACGATTTCAAGGTCCCGCCCTGCCCTGGGCGGGGCTG 
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CTCCTGGTCTTGCTG(Om 

CGATGC6AAA6CGCGCTGGACGATCAAT6AATAC6CCGACCTGGAGTGCCCCTTCT6CAAGGTCTACACCCC6CG6CTTA 
AGCGCTGGGTAGACAGCCATCCGGACGTGAACCTGGTTTGGCGCCATCTTCCCCTGCAGATGCATGGCGAGGCGGCCCGC 
CACCAGGCTCGCCTGGTGGAGTGCGCGGGGATCCAAGGCGGCGCCAAAGCCTTCTGGAGCGCTATCGATGCGATCTTCGC 
TCAGTCGGCCGGCAACGGGGGCGGGCTGCCTGGCGGCACATTGGACTTTCCTGAACTGGACCAGGCTCGACTGGAGAAAT 
GTGCGAAAGACAACGAACTTATTGACTCAGATATCAAGTTGGACATCGACATTGCACGGTCGAAGGGCATTACAGCGACC 
CCGACCCTCGTCATCC(mCAACCAGACG(&ACGAA(m 

GGCGATAGACTGGCTAGCCAAGGATCTCTAGCGTCGCGCC^GAGACTCTTGGCTAAGAAATCGGCGAGGATTCCAACAT 
CCCTCTTTTGGTCCTCCAGGATGCCCTGCACTTCACCTGGCAGAACCTCGACCTCCTCCCCATCCACAATCTTTACCATT 
CTCTTGTGGCCGGAGCTGGTGAGGCTAAGCCTCAACTCCATTGCCGGCCGAGCATTGATGTAAATGCTCTCGAGCAAGCG 
CTCCATGACTTCGACCACTCCTTAATATCAGTTAGCCAGCTACATACAGGAATTATGCTACCCAGGACATGCAGGCGTCA 
CCCCTACTTATGTACGTGGCAGCGTTCGATCACGGCTCGAAAAAATACACCACCTACGAGTTGATGATTTCCTGCTGCCC 
TATCGGAAGTGATCCTGTCTGCTGTCTGTACCTTTCTAGAACCGGTACAGACCCATGCCTCTTCATCACTCCCCCCCTGG 
CCGGCGGCCACCAACGCTGGCCGTTGGCGTACTACTGGTACTGCTGAGCAGCGCGAGTCAGGCCGAAACCTGGGTCATCA 
CCGACAAGGCTCATCCGGTCTCTGCCACCGGATCGTCGCGCGTTCTGTTTCTGGACGCCCAGGAACACCTCGAGGAGCAA 
CTGACTGCGGCCTTGCCCCAGGATCCACAGCATGCT(^AGCGGCGTTTAAGCGATTGCTACAAAGCCCCGATGGGCGCCG 
CCTGCAG(£AGAGCTGGTCAAG(3CACAACAAGACGTC(m^ 

TAGTCGATAGGCAGTACGTGGTCTACGGCGAACCGGATGTTTCGCGCGCTCTTGAGCTAATCGCCAAGGCCAGGAGGTCG 
CGCTGATGACCAGCCTCAACCTCCGCCGCCTGGCAGCGGCGGCCGCCACCTTCAGCCTCTCGTTCACGGCCTCGGCCGCG 
ATCAACAGCGCTGCCATCGTCTCCTCCACCCTTTCCCCTCAGTGCCTCGAATACAAGGTCGTCGGGATCTGTTACTGGCT 
GCTCTGCGGCCCGCATGGCTGCAAAGTGAAGACGTCGGTCAAGGTCCGCCACTACGTGCCTGACGCAGTCGTCTCCAGCT 
ACGCGAATACCGGGAGCAACCCCTGGACCGAGGTATCGGCGCTGGGTACACCGAATCCACTCGCCCAGGCCGGCAATGAC 
GCGACCACAAACTACAAGGCCGAGAACAGCATCGGCCGCTTCAAGGAAGCGGATGTGATCGGCCATCCTGGTGGCGCCAC 
GTTCAGCCGGTTCGCCAGCGCCTCTGGGTACGTTTGCCCTGGCGCCACCGTCCCGCTGGTGCCGTACTTTCTCAGCACAC 
TGGACGCCATTGGCTGGCGGCATGGAATTCCCGAGCAGGTGTACCCCGAAGCGTTGGTCCCAGGGCTGCGCGAGGTGGGT 
GGAATCTTCTCCGGCGACATGTGGGGGAACCTCTATCCGCGCAGCGGCTTCCTGCACCAGACCGACGACTACAAGACGGC 
AGCCGTCATCGCCCAGCGCGCCGGCGATATCACCACGCGAATCGGCCAGCTCCACGTCTACCTCCCCATGCGCGCAGCCC 
CCAAGGACGGCTACTGGCCGGCGGGCGAGCTGAAAGAGGGCGATGCCTCGACCGGGAAATGGCAGGAGCTGACCCCATCC 
CTGAGCCTCAACTGCGCGGTGTTTCCCAACTCTGGGCCGAAGACGCAAGCCGTCGACGGGGAGCACGCCTGGGCGCTCTG 
GCGTCCCTACTCCTGCTGCCAGCGCAAGGGGCAGATGTTCATCTGCAGTACCGACTTCCAATAAGGACACGGAGACGAAT 
CATGCGAATGAACATCACCTCGGTCGCGCTAATGTGGCTGCTCGCAGCGCAACTTGCCCAGGCCGACGACCCGATCAACG 
TGTCCAAGACCGGCACGGTGCTCAGCGACGAGGTCCTCTACAGCATTGGCGGCGGCAGTGCGGTGAGCATGGGCAGCGCC 
GGCCAGATGGACTCGATCGGCGTCGGCTTCGGCTGGAACAACGACATGATGTGCGGAAACATGAACCTGAGCACCACCCT 
GGAGAACCAGCTCAACGGTGCCACACAGGGTTTCCAGAACATCATGGGCTCAGTCATCCAGAACGCGACCGGCGCGGTCA 
TGTCGCTGCCGGCGTTGATCATCCAGCGCGCGAACCCTCAGCTCTACAACCTGATCACCAATGGCATCCTGCAGGCGCGG 
ATCGACTACGACCGCTCGAMGGGACTTGCAAMCGATCGCCGAAAAGATGGCTGAC^TCGCTGGCGAGCAGACCGGCTG 
GGGGAAAATCGCCGAAGGCCAAGCCCTGGGCGCCACACTGGCCTCTGACGGGAAAGACGCCGTATCCGCCCTCGAAGCAG 
TGGAGMGAMGGCGGCAACGATGGCGTAACCTGGGTTGGTGGAGACAAGGCCGGCGGCTCCGGCCAGAAGCCCATTCGC 
ATCGTCAACGACGTGACCCGGGCGGGCTACAACCTGTTGACCAGCCGCTCGGTGAATGATTCGTCGAGCGTGCCTTCCGC 
CACTTGCAACMCGGCCTGGTCTGCAACACTTGGTCCTCCCCCCAGGAGGCCGCCGCATTCGCCACCCGGGTACTGGGGG 
AGCAACAGCAACAGACCTGCGAAGGCTGCCAGAAGACGGTGACGGCTGCTGGCGTCGGCCTCACCCCGCTGATCCAGGAG 
ACCTACGACAAGAA(£TC(aGTCGCTGCAG(3AOT 

CGGCACCGATGCTCTGCCAATTACCCGCGGCGTCATCGAGGCGCTGCGCGACGAGCGTGACCAGGACGTCCTGGCGCGCC 
GCCTGGCGTCCGATGTCTCCCTGATGGACGTGCTCAGCAAGGCACTGCTACTGCAGCGCCTGATGTTCGCCGGCGCCAAG 
GAGCCCAACGTCGCCGCCAACGGCCTGGCCACCCAAGCCGTCGATCAGCAGACCAGCCTCCTGCAGCAGGAGATCTCCAA 
TCTCAAGACCGAACTGGAACTCCGTCGCGAGTTGGCCAGCAACTCCCCCATGCGGGTCATCGAGCGCGGGCAACAACGCG 
CCTCAGGGTCCAGTGGCGTGTTCGAGTCGGCGCCCGATGCCGATCGCCTCGATCGCCTGCAGGCCCCCTCTGCCGCCGGC 
GGCAAGTCGGGAGGGAGACCGTGATGGCAGATACGCTCACCACCCGAAAGCTTCTCGGTCAGCTACTGGTCGGAGTGCTG 
ATCGTCATCGGACTGGCAGTGGTCGGTACGCTGCTCAGTCTCTTCGCCCTGAACCACTTCGGTGGCATCCAGGGCCTGGA 
GGCCTGGCGGCAAAGCAACTACTGGAGCTTGTTCGCCTGGCGGGCGCTGCTGTACTGCGCCCTGGCCATCGCCTGGTTCC 
GGCAGCGCAAGGAACTGAGCGCGCATGAGCGGCAGCGCATTCGGCGGATCGAGATCCTGGTGCTGTTGCTGGTCCTGCTC 
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ATCGAATTCA6CAAAGCCTACTTCCGCACGGGA6GCGCAGCATGACCTTCAT6ACCAATGACTACCTG6AGTATTACCTC 
ACCCTCCTCGGCTGGATCATCAACAACGGGATCTGGAACATGATCTCGGATACTGGCCTGTTCGCGGTGCCGTTCGCGGC 
CATCGTGATGCGCGAATGGCTGAAAGTTCGTGGGGAAGGCGCCGACGAGGGCAACAAGGGAGTGCTGTCTCTCGCCCGCA 
TCGAGACGCATATCTACGTCGGCTACATCGTGGTCGCCCTGGCGGGGATCCCGGTCGTCAACGTGAGCTTCGACACCATC 
GAGTTCGACCAGACTCGCGCCCAGCAGTGCC^ 

CAGCCTGGCCGGCAAGAGTGCGCAGATGCCGCTCTGGTGGGCGATGATGCACGCCCTGTCCAAGGGCTTCACCAGCGGCG 
CCATC(KGGCCATTCCGTGC(mCGGATCTGCM^ 

CTGGCAC^GAAATCGCTGATTTTTCCAGAGACTGCTACGGGCCTTCCCGTGCGCGGCTGTTCATGCGGCAACCCGACCT 
GGGCTCCGTCGCCGAGGACAACAAGGCGTTGCAAGACCTGAACTGGATCGGCTCCCGATTCTTGTTGAACACCCCGGGGT 
ACTACGACACCGACTACTCGAAGAGTCCCCGTCAGTCGTGGCCCTACAACGCCACCCGCGATGCCGGCCTGCCTCAGGTG 
GGCGGTGGTGGCGGCTACCCAACCTGCAAGCAGTGGTGGGCTGACTCAGGGATCGGCTTGCGTGATCGGATCAAGGACCA 
GGTGGATCCGGACCTGATGACCAGCTTCCTCAAGTGGGCGAAATGGTTGAACCAGGACGAGGTGACCGAGGCTGTCATTC 
GCCAGGTGATCTCACCCTCCAGCCAGGTCAAGGGTAACGTCTACACCGATTACGGCGGGCAGGTGGGCGGCACCGTGTGG 
AACGGCATCGCGAGAACCGCAGGAACCTTCGGCGTTGCGGTGGGCAGCTTGGCATACTTCCCGGCGATGGATATGGTCCG 
CCAGGCACTGCCGATGGTGATGTCGTTCCTGAAGATGGCAATGGTCATCTGCATTCCGATGGTCCTGGTCATCGGCACCT 
ATCAACTGAAAGTTGCCATGACGATGACGGTCGTCTTCTTTGCGATGATGTTCGTCGACTTCTGGTTTCAGTTAGCCAGA 
TATATCGACAGCACGATACTTGATGCTTTCTATGGTTCGGGATCACCACATCTTTCATTCAACCCAGTCATGGGGCTGAA 
TACGGCTACTCAAGATGCGATCTTGAACTTCGTTATGGGTTCTATGTTCATTGTTTTACCACTACTGTGGATGACAGCGA 
TCGGCTGGTCCGGAATTCAAGCAGGGTCTGTTCTGAACGGATTGAGCAGAGGGACTGAAGGAGTTCAAGCCGCCGGCAAG 
GAAGCAGGAAATAGAGTTAAAAACGCAGTTTGAGGAAGTATAAAGCCATTACCGGGCTCTAGTCCCGGTAATGGTGTGCT 
GCCTGATAGTCAAACAGCTCTGCTACAGATTCAGTTTCAGCTTGTGAAGTCATAGTTGAGACATCTCGACTTGGAGACCC 
TTTAGAGACGGAGATGTCCCCTTTAACCAAGGCAAATCCCACCAGAAAGACACACAGTAGAGTCATTGATACCCCAGCAA 
GCCACATAAAATTCTGTGACACTACAAGAGCTACTAACAAGAATAGGCTAACTCGCTTAACCCAACGTAGAGCTGGACGG 
CGATCATGCAGGCAGAAACGCACAAGCATACCCAGACCAAAACCGATCCGGGAGGCAAAGCCTTTGTTGGTGTGCGCGTT 
CATCATCAATCTCCTGGCTCCCAAAGGGAGGCATCCTGCTATCACCTATACGCCGAAAAAGATGATTTGGCAAGCATTAT 
GGCATATTATGCCACTAGCTATCTGCCGACTGGAGTACCTCATGGCAACGCGAAACGTCGTCCTTCCCGATCCGCTGGAG 
CAGGATATCAACGAGCTGGTGGAGACCGGCCGCTATCAGAATCGCAGCGAAGTCATCCGGGCAGGCTTGCGCCTGCTGCT 
GCAACAGGAAGCCCAGATANGCGCCAAGCTCGAAACCCTCCGCAACGCAACATCCAGTGGGCTGATGCAACTGGAGCGCG 
GCGAGTACGACGAGATCACCAGCGACGAACTGGCCCAATACCTCGACGAGCTCGGCAACCAGGCGAGCCACTGAAGCATG 
GCCMGTACCGCATCTCTCATGATGCCCAAGCGGACATCGTCGATATCCTKGCTTCACCCACAAC^CTTCGKGATGC 
CGCGCGCCGACGTTACCAGGCACTCATAGGGGCGGCGCTGGAAGCAGTTGCGACAGACCCACAACAGGTAGGCAGCATCA 
GCCGTGAAGAACTGGGAGCTGGCCTGCGCAGCATCCACCTCGTTTACTGCCACTCGATGCCCAATGTCGGTAAGGTTGTT 
CGGCCCAGGCACTTCGTCTTCTACCGGGTGGCGACAGACCAGGTGCTAGAGGTGGTTCGCGTGCTTCACGACGCCATGGA 
TGTGGATCAACACCTGCCCCAACGATGAGCAACTGTAACAGGAGCAAATGGCCCAAAGGGGAAATGGGCTTGAAGGGCGA 
AGGGAATGGTTCACAAGGGTAAAAGCCCTACCCGAAAGGGCCTGCCGGAAGGCAGAAAGGGCTGCGTTCGCGCCGGCGAA 
CACGAAAAGGCTACGGGTGAACATCCACCAGCAGGTGCCCCATGGTTGATGCGCAATAGCAGTGGCTTGACCACTCGATT 
GCATTCCCCCTGACCAAGCATTTGCATTCGTGCTGACCACCGCTTGCATGCGGCTTTACCAGTCACACCGTAGTCACGAC 
CGGCTGCAGTCGACCCAAAGCGGACCTTTGTGACAGACCGAAATCGGCCAAAAGCAGACGCTCAAAAGCATCTACGAAAG 
CAGGGACAATTCCGGGGGATACTCCTCTACGTACGTTGCTTATATCGCGGAGGAATTGGCTGAACCGCGGTGCGGAGAGA 
TCTCCTCAAACTGATGGGTTGCACGCATATCGAAGCAGATTACATAGGAGGCTTGCGCTGTTCAACAGCTCCTGAGGGGA 
CTTGGGTTGCCCATGGTTTCCACGGCCCAATCGTTGACGTCATTGACGATTCCGCTGGCTTTTTCAGTACGCATCGCTTG 
GCGCTCCATTACCCAGCCCAATGCGGCCTTGCCGTTGACCAAGCGATTCCAAGGACTGCGATCCATGTAGCCAGCCCTCT 
AATGCATGTATGTATAGGTAAGGTCGTCGTTATTTCGGCGTGGATGTGCTGACTGGAGGTTTAGCGTGGTGAGTCAGTAT 
GTCGTGCGCTATATAAGTCTC^TCTCGGCATCCGCACATGGATGCTGTTAGATTCAGCAAGTGGGAAAAAGCGCGAGCGC 
TAAAAGGAAAGCTGCCGGCCGGCGTGCTCAGTAGCAAGCTACCCCGGCAACACTTAGGAAGAGGTGATCATGAAGTTACA 
GGCATATCGGCTGCAGAACTACCGCCGGCTGCGCGATGTTGTCATCGAGCTCGATGACGAAATTTCTATCTTTGTCGGTG 
CCAACAACAGCGGGAAGACATCCGCCGTCCAAGGCCTGTACTCAATGCTTCGCGGCGAAGTGAAGAAGTTCGAGCTCTTT 
GACTTCAGTGCGGCGCTGTGGGCCGAGATCGATGCGGTCGGCAGGACGCCCCCTGGCGATGAGGATGCGCCQ^AAGGTT 
ACCGTCCATACTCTTGGATCTCTGGTTCCGCGTCGGTGAAGACGACCTCGCCACTGCGATGTCGCTGCTGCCGAGCACTG 
AGTGGGACGGCAAGTGCGTCGGGATCCGGGTAGCGTTCGAGCCTCGGGATGCCCACGAGCTCGTCTGGAAGTTCCATGAA 
CTACATGAGAAGGCCAACAACGCAGCTGTCGCGCTTGCGGCCAAGCGCAAGGCCGCCGGGGAGCAAGCTGTGGAGGCGGG 
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C6CGGAA6AC6CGGCTGCG6TGGTGGCCGAT6CCGGCGA6TACAAGCCTTGGCCAGAAA6CCTGACGAAGTACCTCACAA 

AGGAACTGAGCAAGGAATACACCTTCCGCTACTACGTGCTCGATGAGCGGGCTTTTGTCGGCTATCAGGCAAGGGAGGCC 

GACTACGAGCCGCTACCCCTAGGCAAGGAGCCGGGCGGTGCAGCCATTCTCAAGTCGCTGGTGAGGGTCGACTTCCTGCG 

CGCGCAGCGGCACCTCGATGACCCAGATGCCGGTAGCTCTGATCGCGCAGAGAGCTTGTCGCGGCGTCTGAGCAGGTTCT 

ATCACCGCAACCTGGAGAAGCGTGGCGACGACCATG^^ 

CACCTGMGGAAGTCTTCAATGACACCCTCAC^^^ 

GATTCGGGCGGCCTTGGATCCGACCACTGTCTTGGGGCAAGACGCCAAGGTTCACTACGTGATCCCGGGCGTAGCTTCCG 

CCCMCTGCCA(3ACA(X:TACMTGGCCTGGGGTTCAA(MTCTGGTCTACATGGTGGTT(3AGCTGCTCGACTTGCA^ 

CAGTGGAAAGCCGAGGATGACAAGCGAGCTCCGCTTCATTTGGTCTTCATTGAGGAGCCTGAGGCGCATCTGCACGCGCA 

GATCCAGCAGGTCTTCATCAGGAACGTTTTGCGCCTCCTTGAGGATGCTAACGATCACGCGACTTTGTTCCACACGCAGC 

TCGTCATCACCACGCACTCCCCGCACATCCTCTATGAACGCGGATTCTCGCCCATTCGGTACTTCCGCCGCGTCAACGAC 

CAGTTGGGCCATCACACGGATGTGCGCAATCTGTCGCTATTCAAAACGGGCGCGTCCGACGCTCCAGCGCGCGAATTCCT 

GCAGCGGTATCTGAAGCTGACGCACTGCGATCTCTTTTTTTCCGACGCGGTGATATTGGTGGAAGGCAACGTCGAGCGTC 

TGCTCCTGCCTGCMTGATCGAGTTGGTGGCCAAGCGCCTGCGTTCTTCCGCCCTAACCATCCTTGAAGTCGGTGGTGCG 

TTCGCGCATCGGTTCCAGGAGCTGATCGCCTTCGTTGGGCTCACAACACTGGTCATCACGGATCTGGACAGCGTGACGGT 

CAAGACGGACGCCGAGAAGGCCGCCGCGCAAGGCGCAGGCGCTGAGGGCGCCGTTGACGGAGATGACGAGGACGAGGACG 

ACGACCTGAAGCCCTTCGAGCTTGAAGACGACGACGAAGCAGAACCGAGTGGCAAGAAGAAGTCCAAGAAGCGTGGCAGC 

ACCTGCCATGCACACGTGGAAGGTGCCGTCACGTCCAACCAAACCCTCATCAGCTGGATCCCGAAGAAGCGGTCGATGGC 

AGAGCTCTGGGAAGTCACGGCGGAGCAAAAGACGCTGTCGCTGGCTGAGGATTCCAGCGCTGGGGTTCGGGTAGCTTACC 

AGACCAAGGTTTCGGTGACGGTGGGTGCGACGACATCACAGCTCTGCGGCCGCACACTTGAGGAGGCCTTTGGTCTTGAG 

AACGCGGACTGGTGCCAGGCTGAGGCAAACCGGTCGGTCGGCCTCAAGCTCAAGCGCGCACCGAGCAGCCCTGAAGAGCT 

GGCmGAAGTTACACGATAGGGTC&TCGGCMGMCTTO^ 

CGCTCAATGGCTGGAAGGTTCCCGCGTACATCGCCGAGGGCTTGGCCTGGCTCGAAGCCAAAGTGGCCCACGAGCTTGAG 
GCGGATGCTGCCATCGCCACCGAGGTCGCGACTATTGAGCCGACTACAGCCGATGTTGTCGCTATCATTGTTGACCCGGG 
GCAGACGGCATGAGCAGACGAATTGATAGCCCAGATACCGACGCCGACCGCGAGATCCACGCATGCATTGTAGCGACGCC 
TCCGCAGCCCTTCGTGGTTCGTGCTGGCGCAGGTTCCGGCAAGACCACCTCCCTCATCAAGGCGCTGGACTGGGTGATCT 
CGGAGCACGGCGCCAGCATGCGGGCGAGGAAGCAGATAGTCGCGTGCATCACGTATACCGACCTTGCCACCAATGAAATC 
CTGGCGGACGTCAACGATGACCCGCTGGTTCATGTCTCGACCATCCACAGCTTTTACTGGTCTATTGCAAAGACGTTCCA 
GGCCGACATCAAGGTTTGGCTGCAGAACGACATCCGCAGGCGGATCTCCGAACTTGAAGAAGAGTTCGAGAATTACAGCT 
CGCGTGTCCGGCAGACCACGCGCGACAGGAACAAGGCCGACCAAGAGCGATATGTCCGAAGCCTGGAGGCTGTGGCCGGC 
GTCAGGACGTTCAACTACGGCGTGGGCAGTGACTACGCC^ 

CTTCCTGCTACAAAACCGCCCGCTGTTCCGACGGGTCGTGGCGCTGAGCTACCCGTTCGTGTTTATCGATGAGAGTCAGG 
ACACGTTCCCGGGTGTAGTGAAGTCTTTCAAGGAAGTGGAAGCCCAGATGCAGGGCAAGTTCTGCCTTGGTTTTTTCGGC 
GACCCGATGCAGTCGATCTTCATGAGAGGCGCAGGGGACATCCAGCTTGAGGATCATTGGCGGGCCATCACGAAGCCGGA 
GAACTTTCGCTGCGCCAAGCAGATCCTTGACGTCGCCAATGCCGTGCGCGCGCAGGGCGATGGCATGGAGCAAGTCCGCG 
GGCTGCACGAGAGGGTCGATGGGAACCTCAAGCTGGTGGAGGGGTCGGCCCGGATGTTCGTCTTGCCGAACACGCTGAAC 
CGAACCGAGGCTTTGGCAAGAGTCCGAGCGTGGAGCTCGGCGACGAACAACGACGAGGGTTGGACAACCCCAGACATCGC 
AGTCAAGATTCTTGTCATCGTGCACCGCATGGCCGCAAACCGGCTTGGCTTCGGCGGCATCTACTCGGCGCTGAACGACA 
AGACGTCGGATGCCATGAAGCAAGGGATGCAGGACGGCACCGGTTGGCCCGTTCGACCCTTCCTAAGTTTTGCGCTACCG 
ATCGTTGCAGCTGTGAAGGCCGGCAATGAGTTCGCGGC 

GGCTCTGACCGGCCGACGTGCCGCGGATGTATTGCGAGAGCTGCACGCTGCTGCGTCGAGGCTTGTCGCCATGCTGGACG 
AGGCAGGGACCACCATTGGTGACATAGCTCTCCATCTCTGTGACACGGGTCTTTTTGAGTTCGACGAGCGCTATGCGCGT 
GTTCTTGGGTTTGTCA(MGATATT(K:TGACACCGCTCA(^AGCCCGAGGCTGCTGATGCAGTTCCGGCCGAAGGATTATC 
CTTGGACGCGACAATGGCCAAGTTCTTCAATTGCTCTGCGCAAGAGCTTTGGCCCTATGAACGCTATGTCTCAGAAGGCT 
CCCCCTATGCCACGCAGCACGGCGTGAAGGGAGCGCAGTTCGAACGCGTCATGGTGGTGATGGACGAGGAAGAAAGCGAC 
TACCGAACGTACAACTACGAGCGTGTCTTCGCGAGTGCTGAGGCCCGCGCTGCAGATCGTGCACGAGCACTAGACGGTGA 
TGAAAACACTTGGAGCCGAACGCTGCGACTGCTTTACGTCTGCTGCACTCGTGCCCAGCGGGGGCTGGTACTAGCGTTCT 
TTGTCGCCGACCCTGCGACCACCCTGGAAAACGTCGTGGCGAGCGGGATCTTGCCGCGAAGCGCAGTCTTTACGCAGGAA 
GTGTTAGTTGGATGGCCATAGACGAACGATTGCCTGACCTATTTGAAGCCAGAGCCGAGCGGCAGGCTCGGTAGCTAGCG 
GGTGGTAACTCTGAGGACGTAATCGACTGCAATTGGCCGATTTCTGCCTATGACGAGGGGCAGCTACCGACCCATTGCTA 
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tccttc6taa666t66ctac6cacc6ggcac6ct6c66gatcg6ct6aagctaaaaaatc6cacaacagctttgtaaaac 
aagatctcggtcgataacatctatgaacggtctgggttggcaatcagtattggctgccagttcttgtgacgatgcatggt 
ttaaatcgactctttgaggaaatgcgtgggaagcctggaaaagtgctgtttcgcctgcaaagamtaattcatgttcatg 
cgattcgttgtcggcagtgcggcgagtcccaaggctggcgaaggttcatgagctctccaacctcagtagttgcgttggtc 
cttagccttttatcaatcgctgccacaamcctgtggagcgattgttcgatgcccagcgagcagagctacamtctccat 
cacgggtggtgattacamgctgcccagcttatgttgaccaataacgggtcaaagcctgcaactttagtttccttcgaaa 
tcacatcgaaagccacgaccaatacgaaaacatggtttttggtaagcaatacggatggcgaaattctggagccaggcaaa 
acttacaamtcagggcctcaaccgatgagtctatcccaaaaattgtcgaagctgagcgtcggacgattttgaagtctca 
gtacgcacttgcagataattgcgaattaaccgctaaatacatagaggccacggggcagaaggttgtgcgtgtgcaaccgt 
tcatgtgcgacacacctcctgaaaagggtggcctgccccctggtaaacctggcatacccatttggtaccttggtcaagaa 
tgatgtttttatgccgccctgggctttgacgccgattaagcaaagctgtgttcgctcatccaatacgtccctcgcccagt 
taaacgactgttatgtatatgggtgctgccgctacgtaataccttggccctacgcatacgaagttaattctgaaagcgtt 
c^tggacaatcttcctcctcggcgtcgactgcagcggtaaggtgatctactttcgaaacactgcaagggtaggtccttt 
tttggcagcgtccatataccgaccgtggtatggctcagatgcgctggtactgcatttcaccaaataaagttgtgctatat 
cgctcacggccggtatttctatcgtccaaggcgacatattgacgatttcaagggttacgtacccgtcgaaccccggctct 
acctgaccgtcattacaggttaccttkacgaacaatcg^^ 

cggtattttgtatcgatgtttactgcaggtgatgacttgctcaccggatttcagcaccaaattttgctgaatttcttttg 

gttccaaaaaamcgtcgacggg(k:tggatgcgacccatatacmtgcttgcagaggc(kk;caaa(»ttttggctcgtaa 

tatgtatgccccaattgaatcttgagagaaaactcgtcaaataaagttttttcacatatagcgcattgcgcaacaagatg 

ttctagatttttgtcaatgacaatcattttggcttaacttcgttcggcgatggctgagcgttttgttgtgactgaggagg 

ctgcgactggtgttccagcacaaccttgtaatagacggcgtacattataagagaagacaccagggaacttccgattaaat 

tca(^gtccacccgacccatatggtgggattgagcgaamccacttcttcaatcgcgcgc(k:cgcgtgatg(^ 

gatatttcactgccacgcttgatggggctttgggtgtgcgtcgattctgtggagaaaaaaatcacagagccaagcttctc 

gcctttggtaaggcagacatttttgttggtggtgttgaatatcctgagcttgagcttgccatcaaatgcaggttcgacct 

tcgccgaagcaaccagcacgccgcgtgagagaaaaagacttcccgtaggtaggactatgccgtaccgattgtgcggcacc 

ttgatttcttcagcggcctccaccacaacagaatcgtgacccctgatcgtcataccatcgacaatgcgccacaggttttt 

gtcgttgccagaatagttatcactccacccttcacccacggacaattcgagggaaaactcttcaattctctcgtcccctg 

catcagtgagtataaggctgttgctggagtacgatgccgcgtcaaaatccgtgtgggatttcgttgtacgccctttgatc 

tgtaacacactcatcagatttgtccttcgtccttgggctgcgagacgcggccaccctgccattgtcctttataccggccg 

atatccccggataccgcctgaaagatgacgtgcgcaaagcgtgcaccaatctgaatttcaaacgcctcgctgtgattgtt 

ggtgagcgcgaacgtcatcggccctacataacctggaggcagcacactggaactgaacgttatcccgcttctgaacagcg 

TGCTrCTCGGAAAMUUUk^ 

GGTTCCATGACGAAGCAGTCATCCGGGTCTGCGAGCACGACCTCGCTGGCAGGGGTGCGTCGCGTAGATTCTCGCAAGCT 

TCCACCCCCTACTGTCAGGCGAGAGAGGCCTGCGAGTCTGAGGTCAAATCCAACGCCTTCCGGGGTGGTCAACTCACGGT 

GGGCAAGGTGCTTGATTAGTTTTCCATCCCGGAC(^GTTTCAGGAGCGAGTGCGGTGAGTAAATCATCTAnTTGCCTCG 

G(^GGGCTCAGTCTATAGCACT(3A(mCCGCCTAOT 

TGCGTGTCCTGCGGGTTTAGTAGCAAGGACAAAACCGTGCGACATGCG^ 

GGCATAGCAAGCCTTCACCGCCACTCTGATTGGGCT^ 

ACCAAAACTGCCAATGGCAGCTAATGGCCGTTCTCTGCCTGTCGCCTCTTTGGCATGACTGGTCAAGTCGGATGCAAACG 
GTGGTCAGCACCAATGCAATTGGGTGGTCATGTGCGATGCAATTACGCAGTTGAGCCTGGCCCAGTTCCTCCCAAGCAAA 
GCATAAGACCAAGATGGCACATTGCCAACAAAATACCCTTCCCCGCTACCGTTGTTTTATCGTTGTTGCCAGCCCTGATC 
TGGCGGAAAAGCCCGCTCCATGAATCGTCATGGAGCCTCCCATGTTTCAACTCCTTTCCTGGATATCCAGGAAGCCGTCC 
CCCACCCC^CAACCAAAGCTGCCCCAGGGGGATTCATCCTTCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTCGCCG 
CCG^AGCTACTGGAGAACATCTGKAGCGCGCCTCGCTATCCAAGCAGCAATTCGAGGAGATCTACCGGCGGCCACTGG 
CCAACTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGATCAC 
GGCCTGGAGATCGT<mTACGCACTCAAGGTACG(M 

AGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCGTCGTCGACCTGC 
AGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAG 
TCCCGCGMTACCAGCTCCACGGCGCTGCCTCAGCACTTCTC^TCCACCAACTGCTACCGCGCACTGCACTCGATTGGCT 
CAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGCGAGA 
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TCATCGTGAAGGCAGACCA6GCCTCAGTTGCACA6GAGCTAGGAGGCAATCCGGATC6AGCTCTGGCTGCACCGAAGCA6 
TCGCTGCAGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCAACCTAGCGGCCCGTC 
TGATGGATGGCTGACCCAGGACGCACTCTGGCTOTGAGC^^ 

AGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCC 
GAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGCTGGATGGAGAAAC^GTTCACGCTACTG^GATTGCTCC 
AGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCTACAGCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAA 
CGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACGAAGATGATGCTC 
CATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAGGCGATTGCGAAACCCTCAACTGATGATCAAGAAGA 
AACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACTCGCCGGCTG 
CCTCTCCTACGAACACACGCGGGGAGGAGAACCTAM 

GCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGC 
GTCCCGTCGCCTGTTCATCMCGACAC^ 

TTTTCMGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCMGGAGACGACCGGCTGGAAGCTGGTG 
CAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTCTGGTCC 
TC(K:AAGACGAAAGAGCTCAAGGCCTACCTGCTCCA(KJATCCCAMTTGCTGTTCCCTGAGCAGCCTCT(K}ACAACC(^ 
GCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGACGCCGCAGCAGCTCACCGAGGAGTACATCTTCGCGCACG 
ATCTCCGGGAAGCCAGCGCGAAGATCTACCGCGCCGCGACCAAGGCGCTGCTCAAGCACTTCGGTCCTACGGCAACCGTA 
CAGGACGTGGACCACCGGGCTGTCCTGGGATGGCGACGCAAGGTACTGGAACAAGGCCTGTCCAAGCGGAGCTGGAACAC 
GTACTCGAATCATCTGCGGACGATCTGGGGCTATGCCATCGAGCATGAGTTGGTGACGCACTCCCAAGTCAACCCGTTCA 
GAAAGACCACCGTCATTCCCCCCAGGCGAGCAAGCAAAACCGTCGCCGCCGAAGCCATCCTGCTCGCCCGCAATTGGCTC 
AA^TGCAGGACGGCGCAGAGCGCTGttCCGGCGAACGCGCACGGATQCGCCCGCCTGGTTCTGGCTTTGCACGTTTGA 
GGTCTTCTACTTCACCGGCATCCGGnGAATGCGCT^^ 

TCCTCATCCGCGGCGAGACAGAGAAGACTCACAAAGAGTTCGTAGTGCCAATAACGGAGGGGCTTGTGCCTCACCTATCG 
AGGCTCCTGCAAGAGGCCGATAGAGCCGGATTCGCCGATGACGACCAGTTGTTCAACGTCAACCGGTTCTCACCGCACTA 
CAAGAGCAAGGTGATGAACTCCGACCAGGTCGAAGCCATGTACCGGAAGTTGACCGAGAAGGTTGGGGTGCGGATGACCC 
CGCACCGTTTCCGGCACACCCTGGCCACCGACTTGATGAAGGCACCCGAGCGGAACATTCACCTCACGAAGTGCCTGCTC 
AACCACTCGAATATCCAGACCACGATGAGCTACATCGAGGCCGACTACGATCACATGCGTGCCGTGCTGCATGCTAGAAG 
CCTGGCCCAAGGCGCGCTGGAGAATGTCAGGAAGGTGGATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCAT 
GCCKmACCTCTCOTCGAGTGAGTGAAGCGCCGCtt^ 

CCAGGGACAGGCATTCAGGGAGGTCCAACCGCGTGGGAAGCAGATGCGCTACCACAGCCACCTGACACCTTCGAACCAAG 
CGTGCTGTTCACTCTGATGGCTCAmcm 

CAGGCGGATGGGGATCTGCCGCCCGAAGCAATCTCGCCTAGCGATACCGGTACTGAGGGCCGGCTACCGGACGAAAGGTA 
GCCGTGCCTTCCAGCAGATCGTTAGGCCTGTAGGAAAAATCTGGAATTACCGAGAGCGCCTGGATTCCAGCGCCGGCATG 
CTGGCAGAGCCAGCGG^ATTTCAAGGCCAATACCACAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCGCATTGCTAC 
GCCTGCAGAATGGTTTCAGGGCCTTAGAAACAGAAAAGCCCACCTAGAAAGGCGGGCTATTCCATATTGACATCACGTCA 
ATGCGGGCCTAATGTTCGGCCCAGACGGCTGCTAGACAAGAACCGGCGTAACACCCCTTCCTAGCCTATGCAACTCGCCC 
CGTAGAAAATGGTGGGTCGTGTAGGATTCGAACCTACGACCAATTGGTTAAAAGCCAACTGCTCTACCGACTGAGCTAAC 
GACC(^GTAT(^GGTGGTCGG(MTA(^GAGATTCGAACTCCCGACATCCTGCTCCCAAAGC^GGC(X:GCTACCGGACTC 
CGCTATACCCCGATTGGAATTTGGCTCCGCGACCTGGACTCGAACCAGGGACCCAATGATTAACAGTCATTTGCTCTACC 
GACTGAGCTATCGCGGAACGTCTTTCTTCCAACCCTGGACGCTTCCGGTGTTGCTGGATTCGCGTCTCAGAGGCGCGCCA 
TTTTACGGATGCGCGCGGGCATGTCMCCCTCTGATCCAAAAAGTTTTTCTTCTTTTTCCACGAGCGACAAAACGGCCCT 
TCCACTGCATGCGGCAGCGCTCTCGCGCCTACCGGACGCCCATGAAAAAGCCCCGCCGAAGCGGGGCTTTCCCTGTCCGC 
CCCCGAAGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTCGCCGAGATACTGGCACCCGGCGCGAATTTGCCGGC 
CAGGATCAGTTGCGCCAGCGGGTTCTCGATCCAGCGCTGGATGGCCCGCTTCAGCGGGCGTGCGCCATAGACCGGGTCGA 
AGCCGACGGCAATCAGCTTGTCCAGCGCCTCCTGGCTCAGTTCCAGGCTCAGCTCGCGCTCGGCCAGGCGCTTGCGCAGG 
CGACCGAGCTGGATCTCGGCGATGCCGGCGATCTGCTCGCGAGCCAGCGGCTCGAACACCACCACTTCGTCGATCCGGTT 
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GATGAATTCC6GACGGAA6TGCGCATTGACCGCGTCCATCACTGCGGCAC6TTGCGCCTCGCGGTC6CCGGCCAGCTCCT 
GGATCTGCGCCGAACCGAGGTTGGAGGTCATCACCACCACGGTGTTGCGGAAGTCCACCGTACGCCCGTGACTGTCGGTC 
AGGCGTCCGTCCTCGAGCACCTGGAGGAGAATGTTGAATACATCCGGATGGGCCTTCTCCACCTCGTCCAGCAGCACCAC 
CGAGTAGGGCTTGCGGCGGATCGCCTCGGTCAGGTAGCCGCCTTCCTCGAAGCCGACGTAGCCCGGAGGCGCGCCGATCA 
GGCGGGCCACCGAGTGTTTCTCCATGAACTCGGACATATCTATCCGCACCAGCGCCTCCTCGGTATCGAAGAGGAACTCG 
GCCAGCGCCTTGCACAACTCGGTCTTGCCCACCCCGGTCGGGCCGAGGAAGAGGAACGAGCCGCTCGGCCGGTTCGGATC 
GGCGAGGCCGGCGCGCGAACGGCGCACGGCGTTGGACACGGCGACTACCGCCTCGTCCTGGCCGATCACTCGCCGATGCA 
GCTCCTGCTCCATGCGCAGCAGCTTCTCGCGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTGGAAACC 
ACTTCGGCGATTTCCTCGTCGGTCACCTTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACCATCTGCAG 
GCTGCGTTCCAGGTCCGGGATGGTCTGGTACTGGATGCGCGCCATGCTCTCGAGGTCGCCCTTGCGCCGCGCCGCCTCCA 
TCTCCTGCTTGGCCTGCTCGATCTTCTGCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCC 
TCGAGGTCGGCGTATTCGCGCTCGAGCTTGACGATATCCTCCTCCAGCTTGGCCAGGCGCTTCCTGGTGGCTTCGTCGTC 
TTCCTTCTTCAGCGCCTCGCGCTCGATCTTCAGCTGGATCAGGCGACGGTCGAGACGATCCAGTTCCTCCGGCTTGGAGT 
CGATCTCCATGCGGATGCGGCTGGCGGCCICGTCGATCAGGTCGATGGCCTTGTCCGGCAGTTGCCGATCGGTGATGTAG 
CGGTGCGACAGCTTGGCCGCGGCGATGATCGCGCCGTCGGTGATGCTCACCCCGTGGTGCACTTCATAGCGTTCCTTGAG 
GCttCGGA(WATGGCGATGGTGTCTTCCTC(K:TC(^ 

CCTTCTCGATGTACTGGCGATACTCGTCGAGGGTAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCAGAGCCGGCTTG 
AGCATGTTGCCGGCGTCCATGGCACCTTCCGCCTTGCCGGCGCCGACCATGGTGTGCAGTTCGTCGATGAACAGGATGAC 
CCGGCCTTCCTGCTTGCCCAGTTCGTTGAGGACCGCCTTCAGGCGTTCCTCGAACTCGCCGCGGAACTTGGCACCGGCGA 
TCAGCGCCCCCATGTCCAGGGCCAGCAGGCGCTTGTCCTTGAGGCCGTCCGGCACTTCGCCGTTGATGATGCGCTGGGCC 
A(&CCCTCGACGATGGCGGTCTT(X:CGACGCCGGCTTrc^^ 

CTGGATGGTCCGGCGGATCTCGTCGTCGCGACCGATCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTTGGTCATGTCGA 
CGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCGACGTTCGGGTCGTTCACCGCTTCGCCGCCACGCAGGTTGGCCACG 
GCATTCTCCAGCGCCTTGCGCGACACGCCCTGGCCGAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCCATCGCGGCCAG 
CAATACCAGCTCGCTGGAGATGAACTGGTCGCCCTTCTGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAGGCGTGCGAGAT 
CCTGGGACAGGTTCACGTCGCCGGTCGGGCTCTGGATCTTCGGCAGCGCGTCGAGTTCTTTGTTGAGGCCGCTGCGCAGG 
GCGGCGATATCGAAGCCGACCTGCATCAGCAGGGGCTTGATCGAACCGCCTTGCTGCTCGAGCAGGGCGGAAAGCAGGTG 
CACCGGCTCGATGGCCGGATGGTCATGGCCAACGGCCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAGCTTGCTGGTCA 
AACGGTCTATTCGCATGGGTCGTCCTTCCTTCTATAGAGCGGGCCGGAACGATGGGTGTCCCTGATGAAGAAAAGCCCGC 
CGAGATGACTCAGTAGATAAGGGCGATTTTCCGCGGTTCAAGCGACCGGACCGTGACATCGGTCAGTTGCCGCCGGATAA 
CCTGCGCGGGCCTAGTCCTGGAGCCAGACCAGGCTGGCAAAACGGCCGGTACGCGACGAGCGGCGGTAGGAATAGAAGCG 
CGCGGTATCGCTGAAGGTGCAGAAGCCGCCGCCATGCACGGCGGTGACGCCATGGGCGCCCAGGCGGATCCGCGCGAGTC 
GGTAGATGTCGGCCATGAAGCGGCCCGGATTGGCGCTAGGTACGAAAGCCGAGCGCGCCTCGGCGTGCGCAGCGACGAAT 
GCATCGCGGACCTCGCCGCCGACCTCGAAGGCCTGCGGGCCGATCGCCGGCCCCAGCCAGACCAGCAGTTCGTCGCCGGG 
CACGCCCAGGCTGTCCACCGTCGCCTC(^^ 

TGCCCGAGCGGTCGCAGAACAACGCCGGCAGGCAGTCGGCGGTCATGATCGTACAGGCGACGCCCGGCATCGCGCTCCAG 
CTGGCGTCGGCCCTGAGCACCGGTTCGGGTCGGCCTCCACCACGTCACTCCGTGCACCTATTCCAACCAGCTCGGCCGGC 
ATTCCAGACGCTCGGTCAGGCGTCGGCGGTTTTATTCCACGGCGCGCGGATCGTCGTAGACGTGGGCGCCAAGGTTCAGA 
CTGTCGAAGGGTGCCTGGCTGACCCCGCCACTGCGCGTGGTCACGCAGGCCCGCACACGGGCCGGCGCCGGCCAGTCGGG 
GGTCAGCCAGGCGTTCAACCGACGAACGCCTCGCGATCCTGGCGCAACAGGCTGAGCAGCCAGAGGAATTCTTCCGGCAG 
CGGCGATTCCCACTTCATGCGCACGCCGGTGGCCGGGTGATCCAGTTCGAGGAAGCGCGCGTGCAGCGCCTGCCGGGGGA 
ATTCGCGAAGAGTCTGGACCAGGGTCTGGCTGGCCACCGGGGGAATCCTGAAGCGCCCACCGTAGACCGGATCGCCGACC 
AGGGGATAGCCAATATGGCTCATGTGCACGCGGATCTGGTGGGTACGCCCGGTCTCCAGCTTGACCCGGGTATGGGTGTG 
CGCACGGAAGCGTTCCAGCACGCGGTAATGGCTGACCGCCACCTTGCCGGCGTCGACCACCGCCATCTTCTGCCGCTGCA 
CGCCATGCCGTCCGATCGGCGCATCGATGGTGCCGCCGGAGGTGATCACGCCGATCACGATCGCCTCGTAGATGCGGCTG 
ACCGACCGTGCCTGCAGTTGCGCCACCAGCTTGGTGTGGGCCTCCAGCGTCTTGGCCACTACCATCAGGCCGGTCGTGTC 
CTTGTCCAGGCGGTGGACGATCCCGGCGCGCGGCACATTGGCGATGTCCGGGACATGGTAGAGCAAGGCATTCAGCAGGG 
TGCCGTCCTGATGGCCGGCAGCCGGATGGACCACCAGGCCGGCGGGCTTGTCAATCACCAGGATGTGCTCGTCCTCGTAG 
ACGATTTCCAGCTCGATGTCCTGTGCGAGCCACTCGCCCTGGGCTTCCTGCTCGGCCTCCAGGACCAGTTGCGCGCCGCT 
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6TG6AC6ATGTCGCGCGGGC6CAGCACGGCGCCGTCGACGGTCAGGCGACCGTCCTT6ATCCA6CCG6CCAGACG6GAGC 
GGGAGTGTTCGGGAAAAAGCTGGKG^GATCTGGTCGAGACGCT^CACCCAGCTCGAACGGCACCTCGGCCGCGCGT 
TGAATCATATCGGACATGAGTAGGAGACGATGCTCAGCGCGGCTTTTGGAATCGGCTACGCGCTGTGGTTAAATACGGGG 
TCTTTGTCCCAGGGGGTGCCTGGGGCGCCAATCATAACAGACGGTTGAGGCCAAGCCGACCGTCCCAGGGACGCAAGCCG 
CCATGCAAGTGAAACACCTGCTGCTGATCGCCATCCTCGCCCTCACCGCAGCCTGCTCCTCGAACAAGGAGACTGTCGAC 
GAGAACCTGAGCGAGAGCCAGCTGTACCAGCAGGCGCAGGACGACCTCAACAACAAGAGCTACAACAGCGCCGTCACCAA 
GCTGAAAGCCCTCGAATCGCGCTATCCCTTCGGCCGCTACGCCGAGCAGGCCCAGCTCGAGCTGATCTACGCCAACTACA 
AGAACATGGAGCCCGAAGCCGCCCGCGCCGCCGCCGAACGCTTCATCCGCCTGCATCCGCAGCACCCCAACGTCGACTAC 
GCCTACTACCTCAAAGGCCTGTCCTCCTTCGACCAGGACCGCGGCCTGCTGGCGCGCTTCCTGCCGCTGGACATGACCAA 

Gc<mcc<xi<^ccGa<^^ 
cg(»c(maggcgcgcatggtgtacctgcgcaacctgct<m^ 

cgccaggcctatgtcgccgccgccaaccgcggtcgctacgtggtggagaacttccaggaaaccccggccgtcggcgatgg 
cctggcgatcatggtcgaagcctaccgtcgcctgggtctcgacgacctggccagcaccagcctggaaaccctcaagctga 
actatccggataacgccagcctcaaggatggcgagttcgtcgcccgcgaaagcgaggccgacacccgctcctggctggcc 
aaggccaccctgggcctgatcgaaggcggcgagccgccgccgcacatggaaacccaggccgccaaggacgtgatcaagca 
gtacgaggatgccgagcgggagatccccgccgaactgaagccggaaaaccaggatcacagcgccgacgacgagaagccgg 
agagcgatgacgacgaggactccggtcgctcctggtggagctacatgaccttcggtctcttcgactgatcgcacgaaaca 
ccgaagggaggcgcaggcctcccttctttttgcccgccgccatgcctctcccagcgccaaacgccgcacagcctggacct 
tcccgtctgggatcgagccggcggcttggctaaactgcagctttctccagcctccgagatcaccatgggccttttccgcc 
tcctgttctggatcgccctgatcgccatcgcgttctggctctggcgtcgctttacccgtcccactccgcgccagcagcaa 
cgtccgcaggacgagccgagcgcatcgccgatggtccgctgcgcccattgcggcgtccacgtgccgcaggccaacgccct 
cgcccacgaacaacgctggtattgcagccaggcgcacctgcgccaggaccagggcgaccgtgcgcgctgaacggctacgg 
ctgagcgaggagcaggggcaacgcatcctccgtctgtaccacctgtaccgcctgaccatcggcctggtactggtcctgct 
gatctccagcgaactggaagatcaggtcctcaagctcgtccaccctgaactgttccatgtcggcagttggtgctacctgg 
tcttcaacatcctggtcgcgctgttcctgccgccgtcgcggcaattgctgccgatcttcatcctcgcgctcaccgacgtg 
ctgatgctttgcggcctgttctacgcaggtggcggcgtacccagcggcatcggcagcctgctggtggtggcggtggccat 
tgccaacatcctgctgcgcgggcgcatcggcctggtcatcgcggcggcggccagcctcggcctgctctacctgaccttct 
tcctcagcctgagcagtccggacgccaccaaccactacgtccaggccggcggcctcggcaccctgtgcttcgccgccgcg 
ctggtgatccaggctctggtgcggcgccaggagcagaccgaaacgctggccgaagaacgcgccgagacggtcgccaacct 
ggaggaactcaacgcattgatcctgcagcgcatgcgcaccggcatcctcgtggtcgatagccgtcaggccatcctcctcg 
ccaaccaggccgccctcggcctgctcaggcaggacgacgtgcagggcgccagcctcggccgccacagcccgatgctgatg 
cactgcatgaa(mtggcgcctgaatccca<mccgtccgc^ 

acccao:tttatcagcctcaaccgcgaagacgaccagcacgtgctgatcttcctcgaagacatttcgcagatcgcccagc 
aggcgcagcagatgaagctggccggtcttggccgcctgaccgccggcatcgcccatgagatccgcaacccgctgggcgcg 
atcagccacgccgcccaactgctgcaggagtcagaggaactggatgccccggaccgacgcctgacgcagatcatccagga 
ccagtcgaagcggatgaacctggtcatcgagaacgtcctgcagctctcccgtcgccgccaggccgaaccgcagcagctcg 
acctgaaggagtggcttcagcggttcgtcgacgaataccccggcaggctgcgcaacgacagccaactgcacctgcagctc 
ggtgccggcgacatccagacccgcatggacccacaccagttgaaccaggtgctgagcaacctggtgcagaacggtcttcg 
ctacagcgcccaggcgcacgggcgcggccaggtctggctgagcctcgcgcgcgacccggagagcgacctgccggtgctgg 
aagtcatcgacgacggtcccggcgtaccggcggacaaactgaacaacctgttcgaacccttctttactacagaaagcaaa 
ggcaccggcctgggcctctatctctcccgcgaactctgcgagagcaaccaggcacggatcgactaccgcaatcgcgagga 
aggcggcggctgcttccgcatcaccttcgcccacccgcgcaaactcagctgacggaagccgcacgcatgagccgacaaaa 
agccctgatcgtcgacgatgaaccggatatccgcgaactgctggaaatcactctcggccgcatgaagctggacacccgca 
gcgcccgcaacgtcaaggaagccgcgagttgctggcccgcgagccgttcgacctgtgcctcaccgacatgcgcctgccgg 
acggcagcggcctcgatctggtccagtacatccagcagcgccatccacagaccccggtggccatgatcaccgcgtacggc 
agcctggacaccgcgatccaggcgctcaaggccggtgccttcgacttcctcaccaaaccggtcgacctcggtcgcttgcg 
ggagctggtggcaaccgccctacgcttgcgcaacccggaagccgaggaagcgccggtggacaaccgcctgctcggcgagt 
cgccgccgatgcgcgccctgcgcaaccagatcggcaagctggcgcgcagccaggcgccggtctacatcagtggcgagtcc 
ggcagcggcaaggaactggtggcgcgcctgatccacgagcaggggccacgtatcgagcggccgttcgtgccggtgaactg 
cggcgcgattccctccgagctgatggaaagcgagttcttcggccacaagaaaggcagcttcactggcgctatcgaagaca 
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AGCA66GCCTGTTCCA66CCGCCAGC6GT66CACCCT6TTCCTCGACGAAGTCGCCGACCT6CCGATG6CCATGCA6GTC 
AAACTGCTCCGGGCGATCCAGGAAAAGGCCGTGCGCGCGGTCGGCGGCCAGCAGGAGGTCGCCGTCGCACGTGCGCATCC 
TCTGCGCCACCCACAAGGACCTCGCCGCCGAAGTCGGCGCCGGGCGCTTCCGCCAGGACCTCTACTACCGCCTCAACGTC 
ATCGAKTGCGCGTACACCGCTGCGCGAACGCC^GAKACATCCCGCTGCTCGCCGAACGCATCCTCAAGCGCCTGGCC 
GGCGACACCGGCCTGCCGGCCGCCAGGCTGACCGGCGACGCACAGGAGAAGCTGAAGAACTACCGCTTCCCGGGCAACGT 
CCGCGAGCTGGAAAACATGCTGGAGCGCGCCTATACCCTGTGCGAAGACGACCAGATCCAGCCTCACGACCTGCGCCTGG 
CCGATGCGCCGGGTGCCAGCCAGGAAGGCGCCGCGAGCCTGAGCGAAATCGACAACCTCGAGGACTACCTGGAAGACATC 
GAGCGCAAOTGATCATGCAGGttCTCGAGGAGACCCGCTGG^ 

CCGCTCGATGCGCTACCGCCTGAAAAAGCTGGGCATCGACTGAAAGTGAAAAGGCCTGTCCGAAGACAGGCCTTTTGGTT 

TTCGCTCCTCAGAGGCGACCAGCCGGGGCGTAGGGGGCCGGGTCGATGATCGGTTCCCGCCCGCTCATGAGATCCGCCAG 

CAGACGGCACGACGCCGGTGCCAGGACCAGCCCGTTGCGGTAGTGCCCGGTATTCAGCCAGAGCCCGTCGAAGCCAGGCA 

CCGGACCGATATAGGGGATGCCTTCGGGAGAGCCCGGGCGCAACCCTGCCCAGTGGGCCACCGGCTGCATGTCCGCCAGT 

TCCGGCAACAGTTCTGCCGCAGACGCCCTGAGGCTTTCCAGCGCCTCGTCGGTCGGCGTCTTGTCGAAGCCCGAATGTTC 

CAAGGTGCTGCCGATCAGGATGTGGCCGTCGCGCCGCGGAATCGCGTAGCGCCCCTTGGCCAGCACCATGCGCGGCAGGA 

AATCCGCCGCGCACTTGTAGAGGATCATCTGACCTTTCACCGGTACCACGGGCAGTTCCAGGCCAAGCGGCTTCAACAAC 

TCGCCGCTCCAGGCGCCTGCCGCCAGCAGCACCTTGTCGCCACGGATCTCGCCACGCGAGGTCGCCACGCCGACCACTCG 

ATCGCCGTCGCGCAACCAGCCGCGCACCTCCGTCTGTTCATGCAACTCGAGATTGGCGAATTGTTGCAGGGATGCCCGCA 

ATGAGCGCGCCAGGCGAGGATTGCGCACATTGGCCACGCCCGACATGTAGACCGCCCGCTGGAAGCCTGCGCCCAGCCCG 

GGCACCGCCGCGTAGGCCTCCTCGATCGGCACTTCCTTCAACGGCCGGGTGTGGTTGCGTGCCCACTGCAGTGCCTCGGT 

CTGGTCGTCCAGGTCCAGCCAGTACAGGCCAACGGTATGGACCTCGGGATCGAGCCCGGTCTCGTCGAGCAAACGCTGCC 

CCAGGGCCGGGTAGAAGTCCTGCGACCAGTGCGCCAGGGCGGTCACCGCCGGGCTGTAGCGCCACGGATAGAGCGGCGAG 

ACGATCCCGCCTCCCGCCCAGGATGCCTCACGCCCACTCTCGCCCCGCTCCACCAGGGTCACCCGCAGTCCGGCGAGCGC 

CAGCTCCCGGGCGGTCAACAGGCCGATGACGCCAGCGCCTACCACTACTACATCTCTACTCACCACAGGGCTCCTACCGA 

TTTGCCAGGAACAGAGAAATATCACTCAAAGGGATCAGATGCTGACGAATTGCCTGCTTCAACGAACTCAGTCGAATCTA 

GTCCCGGTGAAAAGCCCATCATACCCGCAGAGGTATTCATCCCATGAAATCGAGTGGTTTGAATTTGGTGGAACTATCGA 

TAGTCCTATCGATCCTTGCGATAGGCGTGACAATTGCGCTGCCCACCCTCCCCGACAGAATGAAGCGGGACATTAGCCGT 

GATATTGGTGACAGCCTGACTAGTCATGTGATGGCTGCGCGGGCTAGCAGCATACAGAACGGCGTGATCATCGAGGTGTG 

CGGTAGCGGTGACGGCAGTACCTGCAGCGAGGAATGGCATCTCGGCTGGTTCAGCCGTAACGACAGGAGCCAACAGATAC 

TGGCCCGGCATGAAAATACGAGTCGCACCGATATTCATTGGCGGGGCTTCGACAAGCGACTGCGCTACCTGCCTAATGGC 

ACCAGCCCTACAGGTAACGGGCGTTTCTTCGAATGTAAGGACGATCGCATCGAGTGGCAATTGGTGCTCAATCGGCAAGG 

CCGCCTCAGGGTGGCGGGAAAGAGCGAAAATAAAAAGCTCTCTTACCTGTGCTCCAGGCGGTGAGAGAACTGTTTCACAT 

ACCGTTTGCCAGTCATCCCACTCTCCGCTCCGGCTGTCTCTGCTACAGGGACAATGCGCTCTCCACTAGGCAAGATTATC 

TGGCCCTTTTCCTTGTGGAGTACTGCATGCGCTCTATTTGTCGCAGCGCCGGCTTTTCCCTGATCGAGTTGATGATGGTG 

TTGGTTCTGGTCGCCATATTCGCCAGCATTGCCGTACCCAGTTTCAACGCCTTGATCGAGCGCAACCGAATCCAGACTGC 

CAGCGAGGAACTCTACAGCCTGCTTCAGTACGCTCGCAGCGAAGCTGTAAACCGTCATGCCAATGTGAGCATCAGGGCGA 

CGCAGAACAATGACTGGGCAAAAGGCCTGGAAATCATCAGiCGGC^^ 

GTCTCGCTATCCGCCAGCAGTGCGACTGCGGAGCTGACCTTCAACGCTACCGGCACACTTAGCAACCAGGCTGCAAACAT 
TGACATAAAGGTCTGCTTCGCCGGTGACAAAAGTACAGGACGTCTGCTTACCGTTCAGCCCAGTGGACGCGTGATCCTGT 
ACCCATCTTCAAAGCAACCGGACAGCTGTAACTGAGGAAAGCCCATGTCTCGAGAAACGGGTTTCAGCATGATCGAAGTA 
CTGGTTGCTCTGGTGCTGATCAGCATTGGCGTACTGGGCATGGTTGCCATGCAAGGGCGCACGATCCAGTACACGCAGGA 
GTCGGTACAACGCAATGCCGCAGCAATGCTTGCTAGCGACCTGATGGAAATAATGCGTGCGGACCCAGATGCCGTACTCA 
ATCTACGCGCCCAACTACGCGAAGACTCGGTCTACTACAAGGCCAAGGGCAGCGACTTTCCCGCAGCCCCAGCGCGCTGC 
GCGCCATTGCCAGCAGATGCTAAGGAACGTCTCGGCTGCTGGGCCCAACAGGCCTCGAAAGACTTGCCGGGAGCCTCCGC 
ACTCTTGAATAGCCAATTCTACATTTGTCGCAGCCC^CCCCGGGTACCTGCGACAACACC^GGCTCGKCATCGAM 
TCCAGGTTGCCTGGCGAGCCATGGATGGAGCGTGTTTCAACGCCTCTGACTCCACCTTGTGCACCTACAGCGTCCGCTCC 
GAATTGTGAGAACAAGCATGCTCTTCAGCAAAATGCAGAAAGGCCTATCGATGGTAGAACTGCTCGTGGCACTCGCTATA 
AGCAGCTTCCTGATCCTGGGGATCAGCCAGATCTACATCGACAACAAACGCAACTATCTTTTCG^GCAAGGCCAGGCCGG 
CAACCAGGAAAATAGCCGCTTCGTTCTTATGCTGCTGCAGCAACAACTGGATAAGACAGCCTATCGTCGCCTTCACGACG 
ACAACATGGAGAATGCTTTCAAATCCGCGACATTCAATGGCTGTCGTGCATTTGTGGCTGGCGAGACTATCGCTGCGGCA 
ACTGCCCTCAAGGCGGGTGAGTACGGTGTCTGCTTGCGCTATCAACCCGCCTACAAAGGGGAGCATGATTGCCTCGGTAA 
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TGAAATTACCGGAGTTCCGGAAAAGCCCTTCACAAATACTCCCCCTGTCGTCGTTCGCCTGGTCTACCTACCGAGCGCC6 

GTACCCTGAGTTGCAGTCGTCCCGATATCGCCCAGTCGAAATCGGGAGAATTGGTCAGTGGTCTCACAGACTTCCGCTTG 

GAAGCGGGGGTCGGGCCAGCAGATCGTAGCGAACGCAAAGTATCCAGCTTCGTCGCACTACAGGATGTCGCCGGTCGTCC 

TATCCGAGCATTGCGCTTCTCAATCCTGGCAGGCAGCGACAATACAAGCCTGCGCACAGGAGATGATAGCCAGGCACGCG 

ATCGCTGGATCGTCCTTTATCCCGAGAGCAAAAGCGCCATCGAGGCCGCAGACAAAGGCCAGATTTACCAAATAGCGCGT 

GGTAACCAAACCATCAGGAATCTCATGCCATGACCCTGCGCCATACCTCTCGACAGCAGGGATCCACGTTGTTGATCTCG 

CTGGTTATCTTGTTGATGATCACGCTCCTCGCCGTTTCCAACATGCGCGAGGTGTCACTGGAAAGCCGTATCACCGGCAA 

TCTCATC(3AACAGMKGCCTGCGCMTGCGG(X:GMGCTGGGCTACGCGAAGGT(3AACGACGCTTTTTC^TA^ 

AGCCCCCAGAGGTCGGCAGCGGATGCGCCGATAGCAATGTCAAACGGCCTTGCATACTGAACCTGAGTGCCCTCTCCGTA 

CCCCGAGATGACGTGCACAACAATCCGGTGGCAGCCCTGAACGGCAAGACAGATAACGCCAATTCACGTGTCTGGATGCC 

CTACCGAGGCAGCGATCTGAATAACCCTACGCAGATCGACAAAGACCGCGCAGTCACCTGGCAGACCATCACGGTGCCCG 

CTGGCGAACA(MCAACGAAGCGGAAAATCCCM 

AACAGCCGCGCCCTCAACAAGGCGGGCGGAGAGACTGTTCTACAGGCCGTTCATGCACGCCTGTATACCAACTGACTGGA 
GCCAGCGCATGATCCACCAGATTACCCGCGCAGGAAAAAGCCTGCTGGCTGCAGGTTGCACCCTGAGCATCCTGTTCGCC 
TCTGACAGTTATGCCGCCACGGCCCTGAATGTCAGCCAGCAACCCCTGTTCCTAACCCAGGGCGTIGCTCCCAACCTGCT 
GTTCACTCTAGATGACTCAGGCAGTATGGCCTGGGCTTACGTGCCCGACGGTATTAGCGGGAATAGCGGCAGAGCGGGAC 
GTTCCAGCGAnACAACGCACTGTACTACAACCC^ 

CAGATCATCGTTTCCGACTATCCAGTGCCACGCTTCACAGCAGCCTGGCAGGATGGCTACGCCCAAGGCTCCACCACCAA 
CCTGAGCAATAACTATCGCCCTCAATGGGGAACCGGCTGGCTTGGTTGCATCGATAGCAGCTGCAATACCGGGAGAGCTT 
ATTACTATACTTATAAGGTAAGCGCTAGCTGCCCTGCACAGCCGGTGAGCAGCTCCAACTCCTGTTATACCTACAATGCT 
CTTCCTACCAGTCAGGAAAGCAACTTTGCGATATGGTACTCCTACTATCGCAACCGCATCCTGGCCACAAAGACCGCTGC 
CAACCTGGCCTTTTACAGCCTGCCGGAAAACGTGCGTCTCACTTGGGGGGCCCTGAACACCTGTAGCATCGGCGCCAACA 
GCAGAAGCTGCCAAAACAATGCCCTGCTCCAATTCAACAAGCAGCACAAAATCAATTTCTTCAATTGGCTGGCGAACAGC 
CCGGCCAGCGGCGGTACTCCTCTGCATGCGGCTCTTGACCGAGCCGGACGCTTCTTGCAAACCAACGGCACAGCTTATAC 
CACCGAAGACGGAAAGACATATTCCTGCCGGGCCAGCTATCACATCATGATGACCGACGGTATCTGGAACGGTCGGAACG 
TCACCCCCGGCAATCTCGACAACCAGAACCAGACCTTTCCTGATAGCACCCTCTATAGGCCACAGCCCCCTTATGCCGAC 
AGCAATGCCAGCTCATTGGCTGACCTGGCTTTCAAATACTGGACCACAGACTTACGTCCCAGCATCGACAATGACCTGAA 
GCCTTTCATGGCCTACAAGAGTGGGGACGATTCCAAGGATTACTGGGACCCTCGCMCAACCCAGCCACTTGGCAACACA 
TGGT(^CTTTACCGTTGGCCTAGGTCTTTCCTATTCGCTCACATTGAACTCTGCACCAACTTGGACAGGCAGCACCTTT 
GGCAACTACGAGGAGTTGATGGCTGGAAGCAAGGCTTGGCCCAGCGTCGATAACGACGCCGCACCCGGTAACGTCTACGA 
CCTCTGGCATGCAGCTATCAACTCTCGTGGAGACTTCTTTAGCGCGGAATCACCGGACTCTCTGGTTCAGGCTTTCAATA 
AGATCCTGACACGGATTTCCGAGCGCAACACCTCCTCCTCCAAACCAGCAATGACTTCCGCGCTGCAGGATGACGGAACC 
GGCGACAAGCTGATCCGCTACAGCTACCAGTC^ 

GGTGGAGTCGACTTCCACCGGTTCGACCAAAACCCAGGAATGGAGCGCCGGCGCACTGCTGGACAACCGAGCTCCCGCTA 
CCCGTAATATTTACATCGCCAGCAATAGCGGAACCAACCGCCTTAAGCCTTTCACATGGAGCAATATTGAGGGAAGTCAG 
TTAGCCACTTGGCTGAACCGCAACCCGGACAAGGACAATCAGGCCGACACCAAAGGAGCACAGCGGGTCGACTTCATCCG 
TGGCCAGCAGAATATGGATGGATTCCGGCAACGACAGGCGGTGTTAGGGGACATCGTGCACTCGTCTCCAGCCGTGGTCG 
GACCGGCCCAATACCTCACTTATCTGGCCAACCCCATCGAACCCAGCGGCGACTACGGCACATTCAAGACAGAGGCAGAC 
CAGCGCAGCCCTAGAGTTTATGTTGGATCCAACGATGGCATGTTGCATGGTTTCAACATCAAAACCGGCGTGGAAGAGTT 
CGCTTTCATCCCTACAGCAGTATTCGAAAAGCTTAACAAGCTTACCGGCATCAGCTACCAGGGCGGTGCCCACCAATATT 
TCGTCGACGCTACACCGGTCGTCAGCGATGCCTTTTTCGATGGAGCTTGGCACACTGTTCTGATCGGAACGCTTGGTGCT 
GGAGGTCGCGGCCTGTTCGCACTCGATGTAACCAAGCCGGACGATGTCAAGCTGCTTTGGGAATACGATAGCAGTACCGA 
CTCGGACCTTGGTTACACCTTCTCCAAACCTACCGTAGCCAGACTGCACAGCGGACAATGGGCAGTAGTTACCGGCAACG 
GCTATGGAAGCGATAATGACAAGGCAGCTTTACTGCTGATTGATTTGAAAAAGGGAACGCTGATCAAGAAGCTGGAAGTC 
CAAAGCGAGCGCGGAATAGCCAATGGCCTATCGACGCCTCGCCTGGCTGATAACAACAGCGATGGCATTGCTGACTACGC 
CTATGCTGGCGATCTGCAGGGAAATATCTGGCGCTTCGATTTGATCGGCAATACCCGCAACGACGACCCAGACACAAATA 
CCTCTATCAATCCCTTCAAGCCCGGAGATGTAGATCCTTCTGCTTTCAGAGTATCGTTCAGCGGCGCCCCGCTTTTCCGT 
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6CTC6CGCC6ACAACAATACTCGTCAGCCCATCAC(M3CTCCGCCTACCTTGGTAC(K:CATCCTA6CCGTAAG^CTACAT 

CGTCATCGTAGGTACAGGAAAATACTTCGAGGACGATGACGCTCAGGCCGATACCAGCCGAGCCATGACGCTCTATGGTA 

TCTGGGATCGCCAGACCAAGGGCGAAAGCGCAAACAGTACCCCAACCATCGACCGCAACGCCCTCACAGCCCAAACCATG 

ACAACAGAGGCGAACTCCACATTCGGTAGCGTGAACAGGAATATTCGGCTTATTAGCCAAAACCCGGTGAAGTGGTACAA 

AGACGGAGCAACCGGTACCGCGAACTCGGATGTGGCTAGCTATGGCTGGCGACTGAATCTGGAGGTCAATAGCAGCAAGA 

AAGGCGAAATGATGATCGAAGATATGTTCGCTGCCGGCCAAGTGCTTCTATTGCAGACCTTGACACCGAACGACGACCCT 

TGTGACAGCGGCTCTACCAGCTGGACCTACGGCCTCAATCCATATACTGGCGGACGTACCAGTTTCACCGTCTTCGATCT 

CAAACGTGCGGGTATAGTGGACTCTGGCTCGGATTACAACGGCTCGGTCGTATCCGCCTTCCAACAGGATGGACTAGGTG 

GCTTGGCCATTACCCAGAACGAACAGCGTCAATCCGAGGCTTGCACTGGTGATGAGTGCATCATCTTCAACCCCAGCGAC 

AAGAGTAACGGACGACAAACCTGGCGGGTCGTCGAGGAGAAATGAACATGAACCCCTTACGTCTTCTCGCCACAGCTCTT 

GCAGCTCTAGCTCTGGCTTGCCCAACCTTTGCCTTGAGTGCCACGAATACGTTCGAGAATGTGGGCGTGGTCGAGGATGT 

TCATCCTGCCGCCGGTCTGGTAGTAGTCGATGGGCAGACATATCGCTTGCCCAACCGTGTCCAACAACAGGACTCGCCGG 

TCATATTCTTGGTACGTCAGGGACAGACAGTGTCTTTCTCCGGCAAACTCACCAGCGACCTGCCAGAAATCGAGTCGTTC 

TACATTATCMGCAG(K:CCCTCTCGTTCCCTTC(WATCGGAGCAGCAACAATGAAGTCGAA(^GAGGCTTCACTCTCAT 

GAGTTGATGATCGTCGTAGTAATCATCGCTATTCTTGCTGGTATCGCCTACCCCAGCTACGACGAATACGTGAAGCGCGG 

GAATCGCACCGAAGGACAGGCATTACTCAGCGAAGCAGCCGCTACTCAAGAGCGCTATTTTTCACAGAACAATACTTATA 

TCACTACCCMGCCGACATCGGCAAGCTttATA^ 

AGCCTTACCGTCGATACGGTAGCCAACGACGGAGGTTATCGCCTTATCGCTAACCAGGCATTCAACGATCTTGATTGTGG 
CAACCTGACCTTGACCGCCAACGGCGAGAAAGGCCGGACTGGAAGCAAGAAGAGCGTTGCAGAATGCTGGCGCTAAAGCG 
CCGAGACAAGAAAAAGGCAAAGCCCGGCATAAGCCGGGCTTTTTCAGGTGCGCAAAAATTCCGATTACAAAGCCTTGACC 
CGCAGTTCCTTGGGCATCGAGAAGGTAATGTTCTCCTCCCGTCCCTCCAGTTCCTGCTCTTCCGACGCCCCCCACTCACG 
TAGCTGGGCGATCACTCCGCGCACCAGCACTTCCGGCGCGGAGGCGCCTGCGGTGATTCCGATGCGACGCACACCGTCGA 
ACCAGCCGCGTTGCATGTCCTCGGCGCCGTCGATCAGGTAGGCCGGCGTGCCCATGCGCTCGGCGAGTTCGCGCAGGCGG 
TTGGAGTTGGAACTGTTGGGGCTGCCCACCACCAGGACCATGTCGCACTGGTCGGCCAGTTCCTTCACGGCATCCTGGCG 
GTTCTGGGTGGCATAGCAGATGTCGTTCTTGCGCGGCCCCTGGATCTGCGGGAACTTGGCGCGCAGGGCATCGATGACCT 
TCGAGGTGTCGTCCATCGACAGGGTGGTCTGGGTCACGTAGTGCAGGGCTTCGGGCTTGCGCACCTCCAGCGCGGCGACG 
TCGGCCTCGTCCTCCACCAGGTAGATGGCACCGCCGTTGCTGGCATCGTACTGGCCCATGGTGCCTTCCACCTCGGGGTG 
GCCTTCATGCCCGATCAGCACGCATTCGTGGCCGTCGCGGCTGTAGCGCACCACTTCCATGTGCACCTTGGTCACCAGCG 
GGCAGGTCGCGTCGAAAACCTTCAGGCCGCGCCCCTCGGCTTCCTTGCGGACCGCCTGGGAAACGCCGTGGGCGCTGAAG 
ATGACGATGACGTTGTCCGGCACCTGATCGAGTTCCTCGACGAAGATGGCGCCGCGCTGGCGCAGGTTGTCCACGACGAA 
CTTGTTGTGCACCACCTCGTGACGCACGTAGATCGGCGGGCCGAAGACATCGAGGGCACGGTTGACGATCTCGATGGCGC 
GATCCACGCCGGCGCAGAAGCCGCGGGGATTGGCGAGTTTGATTTGCATGGCGGTCTCGTGGGCGACGCGGTGATTGGAC 
GAATGAACCTTGCTACCGCCCTCCCCGCTTGGGAAGGGCGCAGCGACCGACGGTTCAGGCCGGCTGGACGTCGAT 
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>0RF2 (SEQ ID N0:2) 

TCGCC6ATCCAATGCCAAGGAGTACCTG6GCAATCAGA6CCTACTCACGGCT6CC666GCCGGCATTGCCAA6CTCCTGG 
ACGCCGACGAGAACAACACCAGTACCGTCTTCAGCGGCAACGGCACCAGCTTCGGGACGACCGGAACCAACAGCAACTCG 
(miTCAACAGCATCCTCTO^ 

CGTCTACGTGCAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTGGCGATCGACTATGA 
>0RF3 (SEQ ID N0:4) 

CGCCGATCG^TGCCAAGGAGTACCTGGGCAATCAGAGCCTACTCACGGCTGCCGGGGCCGGCATTGCCAAGCTCCTGGA 
CGCCGACGAGAACAACACCAGTACCGTCTTCAGCGGCAACGGCAC(^GCTTCGGGACGACCGGAACCAACAGCMCTCGG 
CCCTCAACAGCATCCTCTCCGGCGGCGTCAGCGACATCCGGCAGTGGATGAACAAGTTGTACGGGGAGGCCTTCGCCGCC 
GTCTACGTGCAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTGGCGATCGACTATGAACTCAAGGGCCGCAA 
GGTCGATTACAGCTCTGGAGCCGCTCATGCAACAGCAGACTTGGACTAA 

>ORF602c (SEQ ID N0:6) 

TCGGCCTGGTCATTCGCGGAGGCATCCTGCTGCGGATCGATTGGGCGCCGCAGCGTTTGCCTGGCATCGAGGAGCAGCCG 
GCCACGGCTGTTGCCTATCGAGCTGGTCGCACCTCGCTCCCAGACGTCGAGCATGTTGGCCTCGCCGTGGGGCAGCATCT 
CCTCCTTGCTGGTGGAGCACGCTGCCAGGGTCAGCGCGCAGGCCAGGCCGGCGCAGAGACGAAGAAGGGGGTTAGTCCAA 
GTCTGCTGTTGCATGAGCGGCTCCAGAGCTGTAATCGACCTTGCGGCCCTTGAGTTCATAGTCGATCGCCAGTTGCTGAT 
CGAGATGCACTGCGACCCGCGCACCTGGCTGCACGTAGACGGCGGCGAAGGCCTCCCCGTACAACTTGTTCATCCACTGC 
CGGATGTCGCTGACGCCGCCGGAGAGGATGCTGTTGAGGGCCGAGTTGCTGTTGGTTCCGGTCGTCCCGAAGCTGGTGCC 
GTTGCCGCTGAAGACGGTACTGGTGTTGTTCTCGTCGGCGTCCAGGAGCTTGGCAATGCCGGCCCCGGCAGCCGTGAGTA 
G 

>ORF214 (SEQ ID N0:8) 

ACAAGTTGTACGGGGAGGCCTTCGCCGCCGTCTACGTGCAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTG 
GCGATCGACTATGAACTCAAGGGCCGCAAGGTCGATTACAGCTCTGGAGCCGCTCATGCAACAGCAGACTTGGACTAACC 
CCCTTCTTCGTCTCTGCGCCGGCCTGGCCTGCGCGCTGACCCTGGCAGCGTGCTCCACCAGCAAGGAGGAGATGCTGCCC 
CACGGCGAGGCCAACATGCTCGACGTCTGGGAGCGAGGTGCGACCAGCTCGATAGGCAACAGCCGTGGCCGGCTGCTCCT 
CGATGCCAGGCAAACGCTGCGGCGCCCAATCGATCCGCAGCAGGATGCCTCCGCGAATGACCAGGCCGACTACACCCGCA 
CGGCCAGCAACGAGATCCACAGTCAGTTCAAACGACTGCCCAATCCCGACCTGGTGATGTATGTGTTCCCGCACCTGGCC 
GGCAGCGATCCCGCCCCGGTACCGGGCTACACCACCGTGTTCCCCTTCTACCAGCGAGTCCAGTACGCCATGCCGGGCGA 
ACGCACGGAGGACTATTGA 

>ORF1242c (SEQ ID NO:10) 

TCTCGTCCTGGGCGTAGAACTGGACAATCCAGGGTGAGGTTTCGTGCTCGTCGAAGGAGTTCTGCAGGGCTTCTTTCAAT 
GCGTCCCGGGCGTTCTGCATCCAATTGGGATCGCGGCCCTCGGTGCCCAAGGGCACCAGTTCGAAGAATGCGGCGCGCGA 
ACGCCCATCCTCCAGGAGCATCACTTGCTCGTCGGGCAGGTACTCAGCCCAGGGCAGCAGGTCTACGAACGATGGGTCGT 
GATCGTAGAGGCGTGACGCTTCGGCCTGCGTGGCGCCATTCTTGCTCCCGGTGTTGGGCAGAGGAATACCCATGGCGGCC 
AGTCGCGCCAGATAGCGCTCAGTCGCTTCTTCCGCGGCCGCTACGTCCAGCGCTCCTGAATCTTCGGGAGCGTCTGCCGG 
TACCGACTGAGGCTGTGTGCGACCGCGCAGAAGGGTTTGAAAAAAGCCCATCAATAGTCCTCCGTGCGTTCGCCCGGCAT 
GGCGTACTGGACTCGCTGGTAGAAGGGGAACACGGTGGTGTAGCCCGGTACCGGGGCGGGATCGCTGCCGGCCAGGTGCG 
GGAACACATACATCACCAGGTCGGGATTGGGCAGTCGTTTGAACTGACTGTGGATCTCGTTGCTGGCCGTGCGGGTGTAG 
TCGGCCTGGTCATTCGCGGAGGCATCCTGCTGCGGATCGATTGGGCGCCGCAGCGTTTGCCTGGCATCGAGGAGCAGCCG 
GCCACGGCTGTTGCCTATCGAGCTGGTCGCACCTCGCTCCCAGACGTCGAGCATGTTGGCCTCGCCGTGGGGCAGCATCT 
CCTCCTTGCTGGTGGAGCACGCTGCCAGGGTCAGCGCGCAGGCCAGGCCGGCGCAGAGACGAAGAAGGGGGTTAGTCCAA 
GTCTGCTGTTGCATGAGCGGCTCCAGAGCTGTAATCGACCTTGCGGCCCTTGA 
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>ORF594 (SEQ ID N0:12) 

CCAGGCCGACTACACCCGCACGGCCAGCAACGA6ATCCACAGTCAGTTCAAACGACT6CCCAATCCC6ACCTGGTGATGT 
ATGTGTTCCCGCACCTGGCCGGCAGCGATCCCGCCCCGGTACCGGGCTACACCACCGTGTTCCCCTTCTACCAGCGAGTC 
CAGTACGCCATGCCGGGCGAACGCACGGAGGACTATTGATGGGCTTTTTTCAAACCCTTCTGCGCGGTCGCACACAGCCT 
CAGTCGGTACCGGCAGACGCTCCCGAAGATTCAGGAGCGCTGGACGTAGCGGCCGCGGAAGAAGCGACTGAGCGCTATCT 
GGCGCGACTGGCCGCCATGGGTATTCCTCTGCCCAACACCGGGAGCAAGAATGGCGCCACGCAGGCCGAAGCGTCACGCC 
TCTACGATCACGACCCATCGTTCGTAGACCTGCTGCCCTGGGCTGAGTACCTGCCCGACGAGCAAGTGATGCTCCTGGAG 
GATGGGCGTTCGCGCGCCGCATTCTTCGAACTGGTGCCCTTGGGCACCGAGGGCCGCGATCCCAATTGGATGCAGAACGC 
CCGGGACGCATTGAAAGAAGCCCTGCAGAACTCCTTCGACGAGCACGAAACCTCACCCTGGATTGTCCAGTTCTACGCCC 
AGGACGAGATCAGCTGGGACAATTTCCAGGAGCAGTTGAGGCAGTACGTCCATCCTCGAGCGCGAGGATCGGCCTTCAGC 
GAGATGTACCTGGCGCTCATGAAGCATCACCTGGAGGGCATTTCGAAGCCGGGCGGACTGTTCGTCGACACCGCCGTCAG 
CAAGCTGCCCTGGCGAGGACAACAGCGCCGCGTGCGGATGGTCGTCTACCGCCGGATCCGCAAGGAGGATGCGCAGATTC 
GCGGACAGGACCCGGCGGCGTACCTGAAATCCATCTGCGAGCGTATCCAAGGCGGCCTGGCGAACGCCGGCATCGTCGCT 
TCGCGCATGGGCGGACAGGAGATCAGGAACTGGTTGATCCGCTGGTTCAACCCGCACCCGGATCACCTCGGCCAGGCCGA 
GGCGGACCTACGTCGCTTCTACGAACTGGTATGCCGTCCGGACGAACCGATCCTGCAGGATGAATTGCCACTGGCCGACG 
GCACTGACTTCTCCCAGAACCTGTTCTATCGGCAGCCTGTTTCCGATGCCACCCAGGGCGTATGGCTCTTCGATGCCATG 
CCGCACCGAGTGATTGTGGTCGACCAGTTGAACAAAGCGCCGCTGACAGGCCACTTCACCGGCGAGACGCTCAAAGGCGA 
TGGCCTCAACGCCCTGTTCGATCGAATGCCCGAGGACACGCTGCTGTGCATCACCATGGTCGTGACGCCGCAGGACATGC 
TGGAAGGGCATCTGCAGCAGCTCTCGAAAAAGGCCGTTGGTGACACCCAGGCCTCGATCCACACCCGCGAGGACGTGGCC 
ACCGTTCGACGCCTGATCGGCCGGGAGCACAAGCTCTATCGCGGAGCGATCGCTCTGTTCGTGCGCGGCCGCGACCATAC 
CCAGTTGGAGGAACGCTGCATCACCCTGAGCAACGTACTGCTCGGCGCCGGCCTGGTGCCGGTCGAACCGCAGAACGAAG 
TCGGACCGCTGAACAGCTACCTGCGCTGGCTCCCCTCAAACTTCGATCCAAACGAGAAGCGAGCCCTGGAGTGGTACACC 
CAGATGATGTTCGCTCAGCACATCGCCAACCTGTCGCCCATCTGGGGGCGCACCACCGGTACCGGACACCCTGGCTTCAC 
GCTGTTCAACCGTGGCGGCGCGCCGTTGACCTTCGACCCGTTCAACAAGCTGGACCGGCAGATGAATGCCCACGGCTTCA 
TCTTCGGGCCAACTGGCTCCGGCAAGTCGGCGTCCCTGACCAACCTCATCTGCCAGATGCTCGCCATGTACCTGCCGCGG 
ATGTTCGTCGCGGAAGCGGGCAACAGCTTCGGCCTGCTGGCCGACTTAGCCAAGCGGTTTGGCCTCTCGGTCCACCGGGT 
GCGCCTCGCCCCGGGCTCCGGCGTCAGCCTGGCGCCGTTCGCGGACGCCATCAAGCTGGTCGAGAGCCCCGACCAAGTGA 
AGGTGCTGGACGCCGAAGACATCGAGGCCTCGGACTCGGTCCAGGGCAGCAAGGCCGACCTCGAGGACGACCAGCGAGAC 
ATCCTGGGCGAGATGGAGATCGTCGCCCGCCTCATGATTACCGGTGGCGAAGAGAAGGAAGATGCGCGCCTGACCCGTGC 
CGATCGCAGCGCCGTCCGCCAGGCGATCCTGGCGGCGGCCAGGACCTGCGCCGCCGCGAACCGCACGGTACTGACCCAAG 
ACGTGCGCGATGCGCTCTACGAGGCCTCCAGGAGCGATAGCACCGCGCCAGAACGCCGCGCGCGGATCGCCGAAATGGCG 
GAA«:CATG(^GATGTTCT(K^TGG(K:GCCGACGGCGA(^TGTTCAATCGC(MGGCACGCCCTGGCCTGAGGCCGACCT 
TACCGTGGTGGATTTCGCAACGTACGCGCGCGAAGGCTACGCCGCCCAGCTCGGGATCGCCTACATCTCGCTGCTGAACA 
CCGTGAACAACATCGCCGAACGCGACCAGTTCAAGGGCCGGCCAATCGTCAAGATCACCGATGAGGGGCACATCATCACC 
AAGCACCCGCTGCTGCTGCCCTACGCCATGAAGATCACCAAGATGTGGCGGAAACTGGGCGCCTGGTTCTGGCTCGCCAC 
CCAGAACATCGACGAO^TCCCAGCCTCCGGGGCGCCGATGCTGAACATGATCGAGTGGTGGTTGTGCCTGAACATGCCCC 
CCGACGAAGTAGAGAAGATATCCAGGTTCCGCGAGCTGTCGCCGGCGCAGAAGTCGATGATGCTCTCGGCCCGCAAGGAA 
AGCGGCAAGTTCACCGAGGGCGTGCTCCTGGCCAAGGGCAAAGAATACCTCGTCCGTGTGGTTCCCCCGAGTCTCTACCT 
GGCCCTGGCCATGACCGAAAACGAAGAAAAGAACCAGCGCTACAACATCATGCAAGCCACCGGCTGCGACGAGCTCGAGG 
CGGCCTTGCAGGTCGCAGCGGATCTCGACAAGGCGCGCGGCCTGCCACCCTTCCCCATTGTTTTCCCAGACCAACCGGCA 
GTGGAGTGCCAGGACGAATGA 

>ORF1040 (SEQ ID NO:14) 

GTACCTGCCCGACGAGCAAGTGATGCTCCTGGAGGATGGGCGTTCGCGCGCCGCATTCTTCGAACTGGTGCCCTTGGGCA 
CCGAGGGCCGCGATCCCAATTGGATGCAGAACGCCCGGGACGCATTGAAAGAAGCCCTGCAGAACTCCTTCGACGAGCAC 
GAAACCTCACCCTGGATTGTCCAGTTCTACGCCCAGGACGAGATCAGCTGGGACAATTTCCAGGAGCAGTTGAGGCAGTA 
CGTCCATCCTCGAGCGCGAGGATCGGCCTTCAGCGAGATGTACCTGGCGCTCATGAAGCATCACCTGGAGGGCATTTCGA 
AGCCGGGCGGACTGTTCGTCGACACCGCCGTCAGCAAGCTGCCCTGGCGAGGACAACAGCGCCGCGTGCGGATGGTCGTC 
TACCGCCGGATCCGCAAGGAGGATGCGCAGATTCGCGGACAGGACCCGGCGGCGTACCTGAAATCCATCTGCGAGCGTAT 
CCAAGGCGGCCTGGCGAACGCCGGCATCGTCGCTTCGCGCATGGGCGGACAGGAGATCAGGAACTGGTTGATCCGCTGGT 
TCAACCCGCACCCGGATCACCTCGGCCAGGCCGAGGCGGACCTACGTCGCTTCTACGAACTGGTATGCCGTCCGGACGAA 
CCGATCCTGCAGGATGA 
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>ORF1640c (SEQ ID N0:16) 

6TCCGCCTCGGCCT6GCC6A6GTGATCC6GGTGCGG6TT6AACCA6C66ATCAACCA6TTCCTGATCTCCTGTCC6CCCA 
TGCGCGAAGCGACGATGCCGGCGTTCGCCAGGCCGCCTTGGATACGCTCGCAGATGGATTTCAGGTACGCCGCCGGGTCC 
TGTCCGCGAATCTGCGCATCCTCCTTGCGGATCCGGCGGTAGACGACCATCCGCACGCGGCGCTGTTGTCCTCGCCAGGG 
CAGCTTGCTGACGGCGGTGTCGACGAACAGTCCGCCCGGCTTCGAAATGCCCTCCAGGTGATGCTTCATGAGCGCCAGGT 
ACATCTCGCTGAAGGCCGATCCTCGCGCTCGAGGATGGACGTACTGCCTCAACTGCTCCTGGAAATTGTCCCAGCTGATC 
TCGTCCTGGGCGTAGAACTGGACAATCCAGGGTGA 

>ORF2228c (SEQ ID N0:18) 

GGGGAGCCAGCGCAGGTAGCTGTTCAGCGGTCCGACTTCGTTCTGCGGTTCGACCGGCACCAGGCCGGCGCCGAGCAGTA 
CGTTGCTCAGGGTGATGCAGCGTTCCTCCAACTGGGTATGGTCGCGGCCGCGCACGAACAGAGCGATCGCTCCGCGATAG 
AGCTTGTGCTCCCGGCCGATCAGGCGTCGAACGGTGGCCACGTCCTCGCGGGTGTGGATCGAGGCCTGGGTGTCACCAAC 
GGCCTTTTTCGAGAGCTGCTGCAGATGCCCTTCCAGCATGTCCTGCGGCGTCACGACCATGGTGATGCACAGCAGCGTGT 
CCTCGGGCATTCGATCGAACAGGGCGTTGAGGCCATCGCCTTTGAGCGTCTCGCCGGTGAAGTGGCCTGTCAGCGGCGCT 
TTGTTCAACTGGTCGACCACAATCACTCGGTGCGGCATGGCATCGAAGAGCCATACGCCCTGGGTGGCATCGGAAACAGG 
CTGCCGATAGAACAGGTTCTGGGAGAAGTCAGTGCCGTCGGCCAGTGGCAATTCATCCTGCAGGATCGGTTCGTCCGGAC 
GGCATACCAGTTCGTAGAAGCGACGTAG 

>ORF2068c (SEQ ID NO:20) 

AGCTTGTGCTCCCGGCCGATCAGGCGTCGAACGGTGGCCACGTCCTCGCGGGTGTGGATCGAGGCCTGGGTGTCACCAAC 
GGCCTTTTTCGAGAGCTGCTGCAGATGCCCTTCCAGCATGTCCTGCGGCGTCACGACCATGGTGATGCACAGCAGCGTGT 
CCTCGGGCATTCGATCGAACAGGGCGTTGAGGCCATCGCCTTTGAGCGTCTCGCCGGTGAAGTGGCCTGTCAGCGGCGCT 
TTGTTCAACTGGTCGACCACAATCACTCGGTGCGGCATGGCATCGAAGAGCCATACGCCCTGGGTGGCATCGGAAACAGG 
CTGCCGATAG 

>ORF1997 (SEQ ID NO:22) 

CACCCAGGCCTCGATCCACACCCGCGAGGACGTGGCCACCGTTCGACGCCTGATCGGCCGGGAGCACAAGCTCTATCGCG 
GAGCGATCGCTCTGTTCGTGCGCGGCCGCGACCATACCCAGTTGGAGGAACGCTGCATCACCCTGAGCAACGTACTGCTC 
GGCGCCGGCCTGGTGCCGGTCGAACCGCAGAACGAAGTCGGACCGCTGAACAGCTACCTGCGCTGGCTCCCCTCAAACTT 
CGATCCAAACGAGAAGCGAGCCCTGGAGTGGTACACCCAGATGATGTTCGCTCAGCACATCGCCAACCTGTCGCCCATCT 
GGGGGCGCACCACCGGTACCGGACACCCTGGCTTCACGCTGTTCAACCGTGGCGGCGCGCCGTTGACCTTCGACCCGTTC 
AACAAGCTGGACCGGCAGATGAATGCCCACGGCTTCATCTTCGGGCCAACTGGCTCCGGCAAGTCGGCGTCCCTGACCAA 
CCTCATCTGCCAGATGCTCGCCATGTACCTGCCGCGGATGTTCGTCGCGGAAGCGGGCAACAGCTTCGGCCTGCTGGCCG 
ACTTAGCCAAGCGGTTTGGCCTCTCGGTCCACCGGGTGCGCCTCGCCCCGGGCTCCGGCGTCAGCCTGGCGCCGTTCGCG 
GACGCCATCAAGCTGGTCGAGAGCCCCGACCAAGTGAAGGTGCTGGACGCCGAAGACATCGAGGCCTCGGACTCGGTCCA 
GGGCAGCAAGGCCGACCTCGAGGACGACCAGCGAGACATCCTGGGCGAGATGGAGATCGTCGCCCGCCTCATGATTACCG 
GTGGCGAAGAGAAGGAAGATGCGCGCCTGACCCGTGCCGATCGCAGCGCCGTCCGCCAGGCGATCCTGGCGGCGGCCAGG 
ACCTGCGCCGCCGCGAACCGCACGGTACTGACCCAAGACGTGCGCGATGCGCTCTACGAGGCCTCCAGGAGCGATAG 

>ORF2558c (SEQ ID NO:24) 

GTCGGCCAGCAGGCCGAAGCTGTTGCCCGCTTCCGCGACGAACATCCGCGGCAGGTACATGGCGAGCATCTGGCAGATGA 
GGTTGGTCAGGGACGCCGACTTGCCGGAGCCAGTTGGCCCGAAGATGAAGCCGTGGGCATTCATCTGCCGGTCCAGCTTG 
TTGAACGGGTCGAAGGTCAACGGCGCGCCGCCACGGTTGAACAGCGTGAAGCCAGGGTGTCCGGTACCGGTGGTGCGCCC 
CCAGATGGGCGACAGGTTGGCGATGTGCTGAGCGAACATCATCTGGGTGTACCACTCCAGGGCTCGCTTCTCGTTTGGAT 
CGAAGTTTGA 
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>ORF2929c (SEQ ID NO:26) 

AGCGCATC6CGCAC6TCTTGG6TCA6TACC6T6C6GTTC6CG6C66C6CA6GTCCT66CC6CCGCCA6GATCGCCTGGCG 
GACGGCGCTGCGATCGGCACGGGTCAGGCGCGCATCTTCCTTCTCTTCGCCACCGGTAATCATGAGGCGGGCGACGATCT 
CCATCTCGCCCAGGATGTCTCGCTGGTCGTCCTCGAGGTCGGCCTTGCTGCCCTGGACCGAGTCCGAGGCCTCGATGTCT 
TCGGCGTCCAGCACCTTCACTTGGTCGGGGCTCTCGACCAGCTTGATGGCGTCCGCGAACGGCGCCAGGCTGACGCCGGA 
GCCCGGGGCGAGGCGCACCCGGTGGACCGAGAGGCCAAACCGCTTGGCTAAGTCGGCCAGCAGGCCGAAGCTGTTGCCCG 
CTTCCGCGACGAACATCCGCGGCAGGTACATGGCGAGCATCTGGCAGATGAGGTTGGTCAGGGACGCCGACTTGCCGGAG 
CCAGTTGGCCCGAAGATGAAGCCGTGGGCATTCATCTGCCGGTCCAGCTTGTTGAACGGGTCGAAGGTCAACGGCGCGCC 
GCCACGGTTGAACAGCGTGAAGCCAGGGTGTCCGGTACCGGTGGTGCGCCCCCAGATGGGCGACAGGTTGGCGATGTGCT 
GA 

>ORF3965c (SEQ ID NO:28) 

GCGCCTGTTGGGCCGTATCAGGCTGTGGATGTTGTTGCAGCCATTCATCCAAGAGCTGCTTTATCTGCGGGACGATATCC 
CGGCGATCGACTGCCCTCAGTTGAATCTGCTGCAGCTCCTCTATCAGTACAGGAGCGCATATCCTTAGCGTCTGCAGGGC 
ATCCTCTTCGGGGTTCTGCAGGATCTGGGTCAGGTTGTCGATCAGGTTCTGGGTCAGCGAATTCAGAACTCTCATTCGTC 
CTGGCACTCCACTGCCGGTTGGTCTGGGAAAACAATGGGGAAGGGTGGCAGGCCGCGCGCCTTGTCGAGATCCGCTGCGA 
CCTGCAAGGCCGCCTCGAGCTCGTCGCAGCCGGTGGCTTGCATGATGTTGTAGCGCTGGTTCTTTTCTTCGTTTTCGGTC 
ATGGCCAGGGCCAGGTAGAGACTCGGGGGAACCACACGGACGAGGTATTCTTTGCCCTTGGCCAGGAGCACGCCCTCGGT 
GAACTTGCCGCTTTCCTTGCGGGCCGAGAGCATCATCGACTTCTGCGCCGGCGACAGCTCGCGGAACCTGGATATCTTCT 
CTACTTCGTCGGGGGGCATGTTCAGGCACAACCACCACTCGATCATGTTCAGCATCGGCGCCCCGGAGGCTGGGATGTCG 
TCGATGTTCTGGGTGGCGAGCCAGAACCAGGCGCCCAGTTTCCGCCACATCTTGGTGATCTTCATGGCGTAGGGCAGCAG 
CAGCGGGTGCTTGGTGATGATGTGCCCCTCATCGGTGATCTTGACGATTGGCCGGCCCTTGAACTGGTCGCGTTCGGCGA 
TGTTGTTCACGGTGTTCAGCAGCGAGATGTAGGCGATCCCGAGCTGGGCGGCGTAGCCTTCGCGCGCGTACGTTGCGAAA 
TCCACCACGGTAAGGTCGGCCTCAGGCCAGGGCGTGCCTTCGCGATTGAACATCTCGCCGTCGGCGCCCATGCAGAACAT 
CTGCATGGCTTCCGCCATTTCGGCGATCCGCGCGCGGCGTTCTGGCGCGGTGCTATCGCTCCTGGAGGCCTCGTAGAGCG 
CATCGCGCACGTCTTGGGTCAGTACCGTGCGGTTCGCGGCGGCGCAGGTCCTGGCCGCCGCCAGGATCGCCTGGCGGACG 
GCGCTGCGATCGGCACGGGTCAGGCGCGCATCTTCCTTCTCTTCGCCACCGGTAATCATGAGGCGGGCGACGATCTCCAT 
CTCGCCCAGGATGTCTCGCTGGTCGTCCTCGAGGTCGGCCTTGCTGCCCTGGACCGAGTCCGAGGCCTCGATGTCTTCGG 
CGTCCAGCACCTTCACTTGGTCGGGGCTCTCGACCAGCTTGATGGCGTCCGCGAACGGCGCCAGGCTGACGCCGGAGCCC 
GGGGCGAGGCGCACCCGGTGGACCGAGAGGCCAAACCGCTTGGCTAA 

>ORF3218 (SEQ ID NO:30) 

GGGGCACATCATCACCAAGCACCCGCTGCTGCTGCCCTACGCCATGAAGATCACCAAGATGTGGCGGAAACTGGGCGCCT 
GGTTCTGGCTCGCCACCCAGAACATCGACGACATCCCAGCCTCCGGGGCGCCGATGCTGAACATGATCGAGTGGTGGTTG 
TGCCTGAACATGCCCCCCGACGAAGTAGAGAAGATATCCAGGTTCCGCGAGCTGTCGCCGGCGCAGAAGTCGATGATGCT 
CTCGGCCCGCAAGGAAAGCGGCAAGTTCACCGAGGGCGTGCTCCTGGCCAAGGGCAAAGAATACCTCGTCCGTGTGGTTC 
CCCCGAGTCTCTACCTGGCCCTGGCCATGACCGAAAACGAAGAAAAGAACCAGCGCTACAACATCATGCAAGCCACCGGC 
TGCGACGAGCTCGAGGCGGCCTTGCAGGTCGCAGCGGATCTCGACAAGGCGCGCGGCCTGCCACCCTTCCCCATTGTTTT 
CCCAGACCAACCGGCAGTGGAGTGCCAGGACGAATGAGAGTTCTGAATTCGCTGACCCAGAACCTGATCGACAACCTGAC 
CCAGATCCTGCAGAACCCCGAAGAGGATGCCCTGCAGACGCTAAGGATATGCGCTCCTGTACTGATAGAGGAGCTGCAGC 
AGATTCAACTGAGGGCAGTCGATCGCCGGGATATCGTCCCGCAGATAAAGCAGCTCTTGGATGAATGGCTGCAACAACAT 
CCACAGCCTGATACGGCCCAACAGGCGCTCATTGAGGCCGTGGACCGCGCGGAGATCCTACAGCGGAGGCAAGCGTGA 

>ORF3568 (SEQ ID NO:32) 

CCGAAAACGAAGAAAAGAACCAGCGCTACAACATCATGCAAGCCACCGGCTGCGACGAGCTCGAGGCGGCCTTGCAGGTC 
GCAGCGGATCTCGACAAGGCGCGCGGCCTGCCACCCTTCCCCATTGTTTTCCCAGACCAACCGGCAGTGGAGTGCCAGGA 
CGAATGAGAGTTCTGAATTCGCTGACCCAGAACCTGATCGACAACCTGACCCAGATCCTGCAGAACCCCGAAGAGGATGC 
CCTGCAGACGCTAAGGATATGCGCTCCTGTACTGATAGAGGAGCTGCAGCAGATTCAACTGAGGGCAGTCGATCGCCGGG 
ATATCGTCCCGCAGATAAAGCAGCTCTTGGATGA 
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>ORF4506c (SEQ ID NO:34) 

GTCAATAAGTTC6TTGTCTTTCGCACATTTCTCCA6TC6AGCCTG6TCCA6TTCA6GAAAGTCCAATGT6CC6CCAGGCA 
GCCCGCCCCCGTTGCCGGCCGACTGAGCGAAGATCGCATCGATAGCGCTCCAGAAGGCTTTGGCGCCGCCTTGGATCCCC 
GCGCACTCCACCAGGCGAGCCTGGTGGCGGGCCGCCTCGCCATGCATCTGCAGGGGAAGATGGCGCCAAACCAGGTTCAC 
GTCCGGATGGCTGTCTACCCAGCGCTTAAGCCGCGGGGTGTAGACCTTGCAGAAGGGGCACTCCAGGTCGGCGTATTCAT 
TGATCGTCCAGCGCGCTTTCGCATCGCCGTAGAGGCTGTGGTTGGCTGGCAGGCCCTTCACCAGAAGCTCTACCCCTACG 
GCGGATGCAGCCAGCAAGACCAGCAGCAGCCCCGCCCAGGGCAGGGCGGGACCTTGAAATCGTTTGGCTGCCCAGCCGCC 
CTTCAAGAGTCTCACGCTTGCCTCCGCTGTAGGATCTCCGCGCGGTCCACGGCCTCAATGAGCGCCTGTTGGGCCGTATC 
AGGCTGTGGATGTTGTTGCAGCCATTCATCCAAGAGCTGCTTTATCTGCGGGACGATATCCCGGCGATCGACTGCCCTCA 

GTTGA 

>ORF3973 (SEQ ID NO:36) m 

GGCCGTGGACCGCGCGGAGATCCTACAGCGGAGGCAAGCGTGAGACTCTTGAAGGGCGGCTGGGCAGCCAAACGATTTCA 

AGGTCCCGCCCTGCCCTGGGCGGGGCTGCTGCTGGTCTTGCTGGCTGCATCCGCCGTAGGGGTAGAGCTTCTGGTGAAGG 

GCCTGCCAGCCAACCACAGCCTCTACGGCGATGCGAAAGCGCGCTGGACGATCAATGAATACGCCGACCTGGAGTGCCCC 

TTCTGCAAGGTCTACACCCCGCGGCTTAAGCGCTGGGTAGACAGCCATCCGGACGTGAACCTGGTTTGGCGCCATCTTCC 

CCTGCAGATGCATGGC(^CGGCCCGCCACCAGGCTCGCCT(^^ 

TCTGGAGCGCTATCGATGCGATCTTCGCTCAGTCGGCCGGCAACGGGGGCGGGCTGCCTGGCGGCACATTGGACTTTCCT 
GAACTGGACCAGGCTCGACTGGAGAAATGTGCGAAAGACAACGAACTTATTGACTCAGATATCAAGTTGGACATCGACAT 
TGCACGGTCGAAGGGCATTACAGCGACCCCGACCCTCGTCATCCGGGACAACCAGACGGGACGAAGCGTGAAGCTTGAAG 
GCATGGCCGACGAGACCACGTTGCTGTCGGCGATAGACTGGCTAGCCAAGGATCTCTAG 

>ORF4271 (SEQ ID NO:38) 

ACCTGGTTTGGCGCCATCTTCCCCTGCAGATGCATGGCGAGGCGGCCCGCCACCAGGCTCGCCTGGTGGAGTGCGCGGGG 
ATCCAA(MCGGCGCCAAAGCCTTCT(WAGCGCTATCGATGCGATCTTCGCTCAGTC(MCCG(X:AACGGGGGCGGGCTGCC 
TGGCGGCACATTGGACTTTCCTGAACTGGACCAGGCTCGACTGGAGAAATGTGCGAAAGACAACGAACTTATTGACTCAG 
ATATCAAGTTGGACATCGACATTGCACGGTCGAAGGGCATTACAGCGACCCCGACCCTCGTCATCCGGGACAACCAGACG 

GGACGAAGCGTGA 
>ORF4698 (SEQ ID NO:40] 

GAAATCGGCGAGGATTCCAACATCCCTCTTTTGGTCCTCCAGGATGCCCTGCACTTCACCTGGCAGAACCTCGACCTCCT 
CCCCATCCACAATCTTTACCATTCTCTTGTGGCCGGAGCTGGTGAGGCTAAGCCTCAACTCCATTGCCGGCCGAGCATTG 
ATGTAAATGCTCTCGAGCAAGCGCTCCATGACTTCGACCACTCCTTAATATCAGTTAGCCAGCTACATACAGGAATTATG 
CTACCCAGGACATGCAGGCGTCACCCCTACTTATGTACGTGGCAGCGTTCGATCACGGCTCGAAAAAATACACCACCTAC 

GAGTTGA 

>ORF5028 (SEQ ID NO:42) 

TTTCCTGCTGCCCTATCGGAAGTGATCCTGTCTGCTGTCTGTACCTTTCTAGAACCGGTACAGACCCATGCCTCTTCATC 
ACTCCCCCCCTGGCCGGCGGCCACCAACGCTGGCCGTTGGCGTACTACTGGTACTGCTGAGCAGCGCGAGTCAGGCCGAA 
ACCTGGGTCATCACCGACAAGGCTCATCCGGTCTCTGCCACCGGATCGTCGCGCGTTCTGTTTCTGGACGCCCAGGAACA 
CCTCGAGGAGCAACTGACTGCGGCCTTGCCCCAGGATCCACAGCATGCTCAAGCGGCGTTTAA 

>ORF5080 (SEQ ID NO:44) 

AACCGGTACAGACCCATGCCTCTTCATCACTCCCCCCCTGGCCGGCGGCCACCAACGCTGGCCGTTGGCGTACTACTGGT 
ACTGCTGAGCAGCGCGAGTCAGGCCGAAACCTGGGTCATCACCGACAAGGCTCATCCGGTCTCTGCCACCGGATCGTCGC 
GCGTTCTGTTTCTGGACGCCCAGGAACACCTCGAGGAGCAACTGACTGCGGCCTTGCCCCAGGATCCACAGCATGCTCAA 
GCGGCGTTTAAGCGATTGCTACAAAGCCCCGATGGGCGCCGCCTGCAGGCAGAGCTGGTCAAGGCACAACAAGACGTCGC 
CGATGCGTGGAGTCTCGGTGTCGAGAAGATCCCTGCCGTAGTAGTCGATAGGCAGTACGTGGTCTACGGCGAACCGGATG 
TTTCGCGCGCTCTTGAGCTAATCGCCAAGGCCAGGAGGTCGCGCTGA 
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>ORF6479c (SEQ ID NO:46] 

ttc6tctccgtgtccttattggaagtcggtactgca6atgaacatct6cccctt6cgctg6cagcagga6tagg6acgcc 
agagcgcccaggcgtgctccccgtcgacggcttgcgtcttcggcccagagttgggaaacaccgcgcagttgaggctcagg 
gat^tcaotcctgccatttcccggtcgagg&tcg^ 

ggctgcgcgcatggggaggtagacgtggagctggccgattcgcgtggtgatatcgccggcgcgctgggcgatgacggctg 
ccgtcttgtagtcgtcggtctggtgcaggaagccgctgcgcggatagaggttcccccacatgtcgccggagaagattcca 
cccacctcgcgcagccctgggaccaacgcttcgggg 

tgtgctgagaaagtacggcaccagcgggacggtggcgccagggcaaacgtacccagaggcgctggcgaaccggctgaacg 
tggcgccaccaggatggccgatcacatccgcttccttgaagcggccgatgctgttctcggccttgtagtttgtggtcgcg 
tcattgccggcctgggcgagtggattcggtgtacccagcgccgatacctcggtccaggggttgctcccggtattcgcgta 
gctggagacgactgcgtcaggcacgtagtggcggaccttgaccgacgtcttcactttgcagccatgcgggccgcagagca 
gccagtaacagatcccgacgaccttgtattcgaggcactgaggggaaagggtggaggagacgatggcagcgctgttgatc 
gcgkcgaggccgtgaacgagaggctgaaggtgkggccgcc^tgccaggcggccksaggttgaggctggtcatcagcgc 
gacctcctggccttggcgattagctcaagagcgcgcgaaacatccggttcgccgtag 

>orf5496 (seq id no:48) 

gctaatcgccaa(3gcca(waggtcgcgctgatgac(agot 

cctctcgttcacggcctc^cgcgatm 

aggtcgtcgggatctgttactggctgctctgcggcccgcatggctgcaaagtgaagacgtcggtcaaggtccgccactac 

gtgcctgacgcagtcgtctccagctacgcgaataccgggagcaacccctggaccgaggtatcggcgctgggtacaccgaa 

tccactcgcccag(^cg(kaatgacgc(^ccacamctacaa(mccga(3aacagcatcggccg™ 

tgatcggccatcctggtggcgccacgttcagccggttcgccagcgcctctgggtacgtttgccctggcgccaccgtcccg 

ctggtgccgtactttctcagcacactggacgccattggctggcggcatggaattcccgagcaggtgtaccccgaagcgtt 

ggtcccagggctgcgcgaggtgggtggaatcttctccggcgacatgtgggggaacctctatccgcgcagcggcttcctgc 

accagaccgacgactacaagacggcagccgtcatcra 

gtctacctccccatgcgcgcagcccccaaggacggctactggccggcgggcgagctgaaagagggcgatgcctcgaccgg 

gaaatggcaggagctgaccccatccctgagcctcaactgcgcggtgtttcccaactctgggccgaagacgcaagccgtcg 

acg(mgcac(mgggc(£tctggcgtccctactcctg^^ 

ttccaataa 
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>ORF5840 (SEQ ID NO:50) 

C6C6ACCACAAACTACAA66CC6AGAACA6CATC66CC6CTTCAA66AA6C66AT6TGATC66CCATCCT6GT66C6CCA 
C6TTCA6CC6GTTC6CCAGC6CCTCTG6GTACGTTTGCCCT6GC6CCACCGTCCCGCT6GTGCCGTACTTTCTCAGCACA 
CTGGACGCCATTGGCTGGCGGCATGGAATTCCCGAGCAGGTGTACCCCGAAGCGTTGGTCCCAGGGCTGCGCGAGGTGGG 
TGGAATCTTCTCCGGCGACATGTGGGGGAACCTCTATCCGCGCAGCGGCTTCCTGCACCAGACCGACGACTACAAGACGG 
CAGCCGTCATCGCCCAGCGCGCCGGCGATATCACCACGCGAATCGGCCAGCTCCACGTCTACCTCCCCATGCGCGCAGCC 
CCCAAGGACGGCTACTGGCCGGCGGGCGAGCTGAAAGAGGGCGATGCCTCGACCGGGAAATGGCAGGAGCTGACCCCATC 
CCTGAGCCTCAACTGCGCGGTGTTTCCCAACTCTGGGCCGAAGACGCAAGCCGTCGACGGGGAGCACGCCTGGGCGCTCT 
GGCGTCCCTACTCCTGCTGCCAGCGCAAGGGGCAGATGTTCATCTGCAGTACCGACTTCCAATAAGGACACGGAGACGAA 
TCATGCGAATGAACATCACCTCGGTCGCGCTAATGTGGCTGCTCGCAGCGCAACTTGCCCAGGCCGACGACCCGATCAAC 
GTGTCCAAGACCGGCACGGTGCTCAGCGACGAGGTCCTCTACAGCATTGGCGGCGGCAGTGCGGTGAGCATGGGCAGCGC 
(XKmGATGGACTmWX^ 

TGGAGMCCAGCTCAACGGTGCQCACAGGGTTTCCAGAACATCATGGGCTCAGTCATCCAGAACGCGACCGGCGCGGTC 
ATGTCGCTGCCGGCGTTGATCATCCAGCGCGCGAACCCTCAGCTCTACAACCTGATCACCAATGGCATCCTGCAGGCGCG 
GATCGACTACGACCGCTCGAAAGGGACTTGCAAAACGATCGCCGAAAAGATGGCTGACATCGCTGGCGAGCAGACCGGCT 
GGGGGAAAATCGCCGAAGGCCAAGCCCTGGGCGCCACACTGGCCTCTGACGGGAAAGACGCCGTATCCGCCCTCGAAGCA 
GTGGAGAAGAAAGGCGGCAACGATGGCGTAACCTGGGTTGGTGGAGACAAGGCCGGCGGCTCCGGCCAGAAGCCCATTCG 
CATCGTCAACGACGTGACCCGGGCGGGCTACAACCTGTTGACCAGCCGCTCGGTGAATGATTCGTCGAGCGTGCCTTCCG 
CCACTTGCAACAACGGCCTGGTCTGCAACACTTGGTCCTCCCCCCAGGAGGCCGCCGCATTCGCCACCCGGGTACTGGGG 
GAGCAACAGCAACAGACCTGCGAAGGCTGCCAGAAGACGGTGACGGCTGCTGGCGTCGGCCTCACCCCGCTGATCCAGGA 
GACCTACGACAAGAAGCTCCAGTCGCTGCAGGAGCTGCTGTCGAAGAGCAAACCACTGACTGCAGAGAACCTGGCTGCGG 
CCGGCACCGATGCTCTGCCAATTACCCGCGGCGTCATCGAGGCGCTGCGCGACGAGCGTGACCAGGACGTCCTGGCGCGC 
CGCCTGGCGTCCGATGTCTCCCTGATGGACGTGCTCAGCAAGGCACTGCTACTGCAGCGCCTGATGTTCGCCGGCGCCAA 
GGAGCCCAACGTCGCCGCCAACGGCCTGGCCACCCAAGCCGTCGATCAGCAGACCAGCCTCCTGCAGCAGGAGATCTCCA 
ATCTCAAGACCGAACTGGAACTCCGTCGCGAGTTGGCCAGCAACTCCCCCATGCGGGTCATCGAGCGCGGGCAACAACGC 
GCCTCAGGGTCCAGTGGCGTGTTCGAGTCGGCGCCCGATGCCGATCGCCTCGATCGCCTGCAGGCCCCCTCTGCCGCCGG 
CGGCAAGTCGGGAGGGAGACCGTGA 

>ORF5899 (SEQ ID NO:52) 

TCGGCCATCCTGGTGGCGCCACGTTCAGCCGGTTCGCCAGCGCCTCTGGGTACGTTTGCCCTGGCGCCACCGTCCCGCTG 
GTGCCGTACTTTCTCAGCACACTGGACGCCATTGGCTGGCGGCATGGAATTCCCGAGCAGGTGTACCCCGAAGCGTTGGT 
CCCAGGGCTGCGCGAGGTGGGTGGAATCTTCTCCGGCGACATGTGGGGGAACCTCTATCCGCGCAGCGGCTTCCTGCACC 
AGACCGACGACTACAAGACGGCAGCCGTCATCGCCCAGCGCGCCGGCGATATCACCACGCGAATCGGCCAGCTCCACGTC 
TACCTCCCCATGCGCGCAGCCCCCAAGGACGGCTACTGGCCGGCGGGCGAGCTGA 

>ORF6325 (SEQ ID NO:54) 

GCCTCAACTGCGCGGTGTTTCCCAACTCTGGGCCGAAGACGCAAGCCGTCGACGGGGAGCACGCCTGGGCGCTCTGGCGT 
CCCTACTCCTGCTGCCAGCGCAAGGGGCAGATGTTCATCTGCAGTACCGACTTCCAATAAGGACACGGAGACGAATCATG 
CGAATGAACATCACCTCGGTCGCGCTAATGTGGCTGCTCGCAGCGCAACTTGCCCAGGCCGACGACCCGATCAACGTGTC 
CAAGACCGGCACGGTGCTCAGCGACGAGGTCCTCTACAGCATTGGCGGCGGCAGTGCGGTGAGCATGGGCAGCGCCGGCC 
AGATGGACTCGATCGGCGTCGGCTTCGGCTGGAACAACGACATGATGTGCGGAAACATGAACCTGAGCACCACCCTGGAG 
AACCAGCTCAACGGTGCCACACAGGGTTTCCAGAACATCATGGGCTCAGTCATCCAGAACGCGACCGGCGCGGTCATGTC 
GCTGCCGGCGTTGATCATCCAGCGCGCGAACCCTCAGCTCTACAACCTGATCACCAATGGCATCCTGCAGGCGCGGATCG 
ACTACGACCGCTCGAAAGGGACTTGCAAAACGATCGCCGAAAAGATGGCTGA 
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>ORF7567c (SEQ ID NO:56) 

ca6t6ccttgctga6cacgtccatca6ggagacatcg6ac6ccag6c6gcgc6ccaggac6tcctg6tcac6ctcgtc6c 
gcagcgcctcgatgacgccgcgggtaattggcagagcatcggtgccggccgcagccaggttctctgcagtcagtggtttg 
ctcttcgacagcagctcctgcagcgactggagcttcttgtcgtaggtctcctggatcagcggggtgaggccgacgccagc 
agccgtcaccgtotctggcagcctto^ 

cctk(mgga(maccaagtgtt(k:agaccag(x:cgttgttgc^gt(wcggaa(wc^c(k:tcgacgaat 

cggctggtcaacaggttgtagcccgcccgggtcacgtcgttgacgatgcgaatgggcttctggccggagccgccggcctt 

gtctccaccaacccaggttacgccatcgttgccgcctttcttctccactgcttcgagggcggatacggcgtctttcccgt 

cagaggccagtgtggcgcccagggcttggccttcggcgattttcccccagccggtctgctcgccagcgatgtcagccatc 

ttttcggcgatcgttttgcaagtccctttcgagcggtcgtagtcgatccgcgcctgcaggatgccattggtgatcaggtt 

gtagagctgagggttcgcgcgctggatgatcaacgccggcagcgacatgaccgcgccggtcgcgttctggatgactga 

>orf7180 (seq id no:58) 
ttcgtcgagcgt(mtccgccacttgcaacaac(&c™ 

tcgccacccgggtactgggggagcaacagcaacagacctgcgaaggctgccagaagacggtgacggctgctggcgtcggc 
ctcaccccgctgatccaggagacctacgacaagaagctccagtcgctgcaggagctgctgtcgaagagcaaaccactgac 
tgcagagaacct(wctgcggcc(mcaccgatgctctgcc^ttacccgcggcgtcatcga(wc(k:tgcgcgacgagcgtg 

A 

>ORF7501 (SEQ ID NO:60) 

CCAGGACGTCCTGGCGCGCCGCCTGGCGTCCGATGTCTCCCTGATGGACGTGCTCAGCAAGGCACTGCTACTGCAGCGCC 
TGATGTTCGCCGGCGCCAAGGAGCCCAACGTCGCCGCCAACGGCCTGGCCACCCAAGCCGTCGATCAGCAGACCAGCCTC 
CTGCAGCAGGAGATCTCCAATCTCAAGACCGAACTGGAACTCCGTCGCGAGTTGGCCAGCAACTCCCCCATGCGGGTCAT 
CGAGCGCGGGCAACAACGCGCCTCAGGGTCCAGTGGCGTGTTCGAGTCGGCGCCCGATGCCGATCGCCTCGATCGCCTGC 
AGGCCCCCTCTGCCGCCGGCGGCAAGTCGGGAGGGAGACCGTGATGGCAGATACGCTCACCACCCGAAAGCTTCTCGGTC 
AGCTACTGGTCGGAGTGCTGATCGTCATCGGACTGGCAGTGGTCGGTACGCTGCTCAGTCTCTTCGCCCTGAACCACTTC 
GGTGGCATCCAGGGCCTGGAGGCCTGGCGGCAAAGCAACTACTGGAGCTTGTTCGCCTGGCGGGCGCTGCTGTACTGCGC 
CCTGGCCATCGCCTGGTTCCGGCAGCGCAAGGAACTGAGCGCGCATGAGCGGCAGCGCATTCGGCGGATCGAGATCCTGG 
TGCTGTTGCT(^CTGCTCATCGAATTCAGCAM^^ 

>ORF7584 (SEQ ID NO:62) 

TGTTCGCCGGCGCCAAGGAGCCCAACGTCGCCGCG^ACGGCCTGGCCACCCAAGCCGTCGATCAGCAGACCAGCCTCCTG 

CAGCAGGAGATCTCCAATCTCAAGACCGAACTGGAACTCCGTCGCGAGTTGGCCAGCAACTCCCCCATGCGGGTCA^ 

GCGCGGGCAACAACGCGCCTCAGGGTCCAGTGGCGTGTTCGAGTCGGCGCCCGATGCCGATCGCCTCGATCGCCTGCAGG 

CCCCCTCTGCCGCCGGCGGCAAGTCGGGAGGGAGACCGTGATGGCAGATACGCTCACCACCCGAAAGCTTCTCGGTCAGC 

TACTGGTCGGAGTGCTGATCGTCATCGGACTGGCAGTGGTCGGTACGCTGCTCAGTCTCTTCGCCCTGAACCACTTCGGT 

GGCATCCAGGGCCTGGAGGCCTGGCGGCAAAGCAACTACTGGAGCTTGTTCGCCTGGCGGGCGCTGCTGTACTGCGCCCT 

GGCCATCGCCTGGTTCCGGCAGCGCAAGGAACTGAGCGCGCATGA 

>ORF8208c (SEQ ID NO:64) 

AGGTCATGCTGCGCCTCCCGTGCGGAAGTAGGCTTTGCTGAATTCGATGAGCAGGACCAGCAACAGCACCAGGATCTCGA 
TCCGCCGAATGCGCTGCCGCTCATGCGCGCTCAGTTCCTTGCGCTGCCGGAACCAGGCGATGGCCAGGGCGCAGTACAGC 
AGCGCCCGCCAGGCGAACAAGCTCCAGTAGTTGCTTTGCCGCCAGGCCTCCAGGCCCTGGATGCCACCGAAGTGGTTCAG 
GGCGAAGAGACTGAGCAGCGTACCGACCACTGCCAGTCCGATGACGATCAGCACTCCGACCAGTAG 
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>ORF8109 (SEQ ID NO:66) 

6CG6CA6CGCATTCGGC6GATCGA6ATCCT6GT6CT6TTGCTG6TCCTGCTCATCGAATTCA6CAAA6CCTACTTCC6CA 
CGGGAGGCGCAGCATGACCTTCATGACCAATGACTACCTGGAGTATTACCTCACCCTCCTCGGCTGGATCATCAACAACG 
GGATCTGGAACATGATCTCGGATACTGGCCTGTTCGCGGTGCCGTTCGCGGCCATCGTGATGCGCGAATGGCTGAAAGTT 
CGTGGGGAAGGCGCCGACGAGGGCAACAAGGGAGTGCTGTCTCTCGCCCGCATCGAGACGCATATCTACGTCGGCTACAT 
CGTGGTCGCCCTGGCGGGGATCCCGGTCGTCAACGTGAGCTTCGACACCATCGAGTTCGACCAGACTCGCGCCCAGCAGT 
GCCAATACAATCTGCCGGCACCGGCGGACACCGGCTGGTCGAGCTCCTTCAGCAGCCTGGCCGGCAAGAGTGCGCAGATG 
CCGCTCTGGTGGGCGATGATGCACGCCCTGTCCAAGGGCTTCACCAGCGGCGCCATCGCGGCCATTCCGTGCGGCACGGA 
TCTGCGGCAGATGCGAATGGAAGTGGACAACACGCGCGTGAACAATCCGCTGCTGGCACAAGAAATCGCTGATTTTTCCA 
GAGACTGCTACGGGCCTTCCCGTGCGCGGCTGTTCATGCGGCAACCCGACCTGGGCTCCGTCGCCGAGGACAACAAGGCG 
TTGCAAGACCTGAACTGGATCGGCTCCCGATTCTTGTTGAACACCCCGGGGTACTACGACACCGACTACTCGAAGAGTCC 
CCGTCAGTCGTGGCCCTACAACGCCACCCGCGATGCCGGCCTGCCTCAGGTGGGCGGTGGTGGCGGCTACCCAACCTGCA 
AGCAGTGGTGGGCTGACTCAGGGATCGGCTTGCGTGATCGGATCAAGGACCAGGTGGATCCGGACCTGATGACCAGCTTC 
CTCAAGTGGGCGAAATGGTTGAACCAGGACGAGGTGACCGAGGCTGTCATTCGCCAGGTGATCTCACCCTCCAGCCAGGT 
CAAGGGTAACGTCTACACCGATTACGGCGGGCAGGTGGGCGGCACCGTGTGGAACGGCATCGCGAGAACCGCAGGAACCT 
TCGGCGTTGCGGTGGGCAGCTTGGCATACTTCCCGGCGATGGATATGGTCCGCCAGGCACTGCCGATGGTGATGTCGTTC 
CTGAAGATGGCAATGGTCATCTGCATTCCGATGGTCCTGGTCATCGGCACCTATCAACTGAAAGTTGCCATGACGATGAC 
GGTCGTCTTCTTTGCGATGATGTTCGTCGACTTCTGGTTTCAGTTAGCCAGATATATCGACAGCACGATACTTGATGCTT 
TCTATGGTTCGGGATCACCACATCTTTCATTCAACCCAGTCATGGGGCTGAATACGGCTACTCAAGATGCGATCTTGAAC 
TTCGTTATGGGTTCTATGTTCATTGTTTTACCACTACTGTGGATGACAGCGATCGGCTGGTCCGGAATTCAAGCAGGGTC 
TGTTCTGAACGGATTGAGCAGAGGGACTGAAGGAGTTCAAGCCGCCGGCAAGGAAGCAGGAAATAGAGTTAAAAACGCAG 

TTTGA 

>ORF9005c (SEQ ID NO:68) 

GTCAGCCCACCACTGCTTGCAGGTTGGGTAGCCGCCACCACCGCCCACCTGAGGCAGGCCGGCATCGCGGGTGGCGTTGT 
AGGGCCACGACTGACGGGGACTCTTCGAGTAGTCGGTGTCGTAGTACCCCGGGGTGTTCAACAAGAATCGGGAGCCGATC 
CAGTTCAGGTCTTGCAACGCCTTGTTGTCCTCGGCGACGGAGCCCAGGTCGGGTTGCCGCATGAACAGCCGCGCACGGGA 
AGGCCCGTAGCAGTCTCTGGAAAAATCAGCGATTTCTTGTGCCAGCAGCGGATTGTTCACGCGCGTGTTGTCCACTTCCA 
TTCGCATCTGCCGCAGATCCGTGCCGCACGGAATGGCCGCGATGGCGCCGCTGGTGAAGCCCTTGGACAGGGCGTGCATC 
ATCGCCCACCAGAGCGGCATCTGCGCACTCTTGCCGGCCAGGCTGCTGAAGGAGCTCGACCAGCCGGTGTCCGCCGGTGC 
CGGCAGATTGTATTGGCACTGCTGGGCGCGAGTCTGGTCGAACTCGATGGTGTCGAAGCTCACGTTGACGACCGGGATCC 
CCGCCAGGGCGACCACGATGTAGCCGACGTAGATATGCGTCTCGATGCGGGCGAGAGACAGCACTCCCTTGTTGCCCTCG 
TCGGCGCCTTCCCCACGAACTTTCAGCCATTCGCGCATCACGATGGCCGCGAACGGCACCGCGAACAGGCCAGTATCCGA 
GATCATGTTCCAGATCCCGTTGTTGATGATCCAGCCGAGGAGGGTGAGGTAATACTCCAGGTAGTCATTGGTCATGAAGG 

TCATGCTGCGCCTCCCGTGCGGAAGTAG 
>ORF8222 (SEQ ID NO70) 

CTACCTGGAGTATTACCTCACCCTCCTCGGCTGGATCATCAACAACGGGATCTGGAACATGATCTCGGATACTGGCCTGT 
TCGCGGTGCCGTTCGCGGCCATCGTGATGCGCGAATGGCTGAAAGTTCGTGGGGAAGGCGCCGACGAGGGCAACAAGGGA 
GTGCTGTCTCTCGCCCGCATCGAGACGCATATCTACGTCGGCTACATCGTGGTCGCCCTGGCGGGGATCCCGGTCGTCAA 
CGTGAGCTTCGACACCATCGAGTTCGACCAGACTCGCGCCCAGCAGTGCCAATACAATCTGCCGGCACCGGCGGACACCG 
GCTGGTCGAGCTCCTTCAGCAGCCTGGCCGGCAAGAGTGCGCAGATGCCGCTCTGGTGGGCGATGATGCACGCCCTGTCC 
AAGGGCTTCACCAGCGGCGCCATCGCGGCCATTCCGTGCGGCACGGATCTGCGGCAGATGCGAATGGAAGTGGACAACAC 
GCGCGTGAACAATCCGCTGCTGGCACAAGAAATCGCTGA 

>ORF8755c (SEQ ID NO:72) 

CAGTCTCTGGAAAAATCAGCGATTTCTTGTGCCAGCAGCGGATTGTTCACGCGCGTGTTGTCCACTTCCATTCGCATCTG 
CCGCAGATCCGTGCCGCACGGAATGGCCGCGATGGCGCCGCTGGTGAAGCCCTTGGACAGGGCGTGCATCATCGCCCACC 
AGAGCGGCATCTGCGCACTCTTGCCGGCCAGGCTGCTGAAGGAGCTCGACCAGCCGGTGTCCGCCGGTGCCGGCAGATTG 
TATTGGCACTGCTGGGCGCGAGTCTGGTCGAACTCGATGGTGTCGAAGCTCACGTTGACGACCGGGATCCCCGCCAGGGC 

GACCACGATGTAG 
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>ORF9431c (SEQ ID NO:74) 

CTGAAACCA6AA6TC6ACGAACATCATC6CAAAGAAGACGACCGTCATC6TCAT66CAACTTTCAGTTGATAGGT6CCGA 
TGACCAGGACCATCGGAATGCAGATGACCATTGCCATCTTCAGGAACGACATCACCATCGGCAGTGCCTGGCGGACCATA 
TCCATCGCCGGGAAGTATGCCAAGCTGCCCACCGCAACGCCGAAGGTTCCTGCGGTTCTCGCGATGCCGTTCCACACGGT 
GCCGCCCACCTGCCCGCCGTAATCGGTGTAGACGTTACCCTTGACCTGGCTGGAGGGTGA 

>ORF9158 (SEQ ID NO:76) 

CGTCTA^CCGATTACGKGGGCAGGTGGGCGGCACCGTGTGGAACGGCATCGCGAGAACCGCAGGAACCTTCGGCGTTG 
CGGTGGGCAGCTTGGCATACTTCCCGGCGATGGATATGGTCCGCCAGGCACTGCCGATGGTGATGTCGTTCCTGAAGATG 
GCAATGGTCATCTGCATTCCGATGGTCCTGGTCATCGGCACCTATCAACTGAAAGTTGCCATGACGATGACGGTCGTCTT 
CTTTGCGATGATGTTCGTCGACTTCTGGTTTCAGTTAGCCAGATATATCGACAGCACGATACTTGA 

>ORF10125c (SEQ ID NO:78) 

GTGATAGCAGGATGCCTCCCTTTGGGAGCCAGGAGATTGATGATGAACGCGCACACCAACAAAGGCTTTGCCTCCCGGAT 
CGGTTTTGGTCTGGGTATGCTTGTGCGTTTCTGCCTGCATGATCGCCGTCCAGCTCTACGTTGGGTTAAGCGAGTTAGCC 
TATTCTTGTTAGTAGCTCTTGTAGTGTCACAGAATTTTATGTGGCTTGCTGGGGTATCAATGACTCTACTGTGTGTCTTT 
CTGGTGGGATTTGCCTTGGTTAAAGGGGACATCTCCGTCTCTAAAGGGTCTCCAAGTCGAGATGTCTCAACTATGACTTC 
ACAAGCTGAAACTGAATCTGTAGCAGAGCTGTTTGACTATCAGGCAGCACACCATTACCGGGACTAG 

>ORF9770 (SEQ ID NO:80) 

TCAAACAGCTCTGCTACAGATTCAGTTTCAGCTTGTGAAGTCATAGTTGAGACATCTCGACTTGGAGACCCTTTAGAGAC 

GGAGATGTCCCCTTTAACCAAGGCAAATCCCACCAGAAAGACACACAGTAGAGTCATTGATACCCCAGCAAGCCACATAA 

AATTCTGTGACACTACAAGAGCTACTAACAAGAATAGGCTAACTCGCTTAACCCAACGTAGAGCTGGACGGCGATCATGC 

AGGCAGAAACGCACAAGCATACCCAGACCAAAACCGATCCGGGAGGCAAAGCCTTTGTTGGTGTGCGCGTTCATCATCAA 

TCTCCTGGCTCCCAAAGGGAGGCATCCTGCTATCACCTATACGCCGAAAAAGATGATTTGGCAAGCATTATGGCATATTA 

TGCCACTAGCTATCTGCCGACTGGAGTACCTCATGGCAACGCGAAACGTCGTCCTTCCCGATCCGCTGGAGCAGGATATC 

AACGAOTWT(MAGACCGGCC(M:TATCAGAATCGCAGCGAAGTCATCC(K3GCAGGCTTGCGCCT(K:T(3CTGCAACA 

AGCCCAGATANGCGCCAAGCTCGAAACCCTCCGCAACGCAACATCCAGTGGGCTGATGCAACTGGAGCGCGGCGAGTACG 

ACGAGATCACCAGCGACGAACTGGCCCAATACCTCGACGAGCTCGGCAACCAGGCGAGCCACTGA 

>ORF9991 (SEQ ID NO:82) 

AGCTGGACGGCGATCATGCAGGCAGAAACGCACAAGCATACCCAGACCAAAACCGATCCGGGAGGCAAAGCCTTTGTTGG 
TGTGCGCGTTCATCATCAATCTCCTGGCTCCCAAAGGGAGGCATCCTGCTATCACCTATACGCCGAAAAAGATGATTTGG 
CAAGCATTATGGCATATTATGCCACTAGCTATCTGCCGACTGGAGTACCTCATGGCAACGCGAAACGTCGTCCTTCCCGA 
TCCGCTGGAGCAGGATATCAACGAGCTGGTGGAGACCGGCCGCTATCAGAATCGCAGCGAAGTCATCCGGGCAGGCTTGC 
GCCTGCTGCTGCAACAGGAAGCCCAGATANGCGCCAAGCTCGAAACCCTCCGCAACGCAACATCCAGTGGGCTGATGCM 
CTGGAGCGCGGCGAGTACGACGAGATCACCAGCGACGAACTGGCCCAATACCTCGACGAGCTCGGCAACCAGGCGAGCCA 
CTGAAGCATGGCCAAGTACCGCATCTCTCATGA 

>ORF10765c (SEQ ID NO:84) 

CACCTGGTCTGTCGCCACCCGGTAGAAGACGAAGTGCCTGGGCCGAACAACCTTACCGACATTGGGCATCGAGTGGCAGT 
AAACGAGGTGGATGCTGCGCAGGCCAGCTCCCAGTTCTTCACGGCTGATGCTGCCTACCTGTTGTGGGTCTGTCGCAACT 
GCTTCCAGCGCCGCCCCTATGAGTGCCTGGTAACGTCGGCGCGCGGCATCGCCGAAGTGGTTGTGGGTGAAGCGCAGGAT 
ATCGACGATGTCCGCTTGGGCATCATGAGAGATGCGGTACTTGGCCATGCTTCAGTGGCTCGCCTGGTTGCCGAGCTCGT 
CGAGGTATTGGGCCAGTTCGTCGCTGGTGATCTCGTCGTACTCGCCGCGCTCCAGTTGCATCAGCCCACTGGATGTTGCG 
TTGCGGAGGGTTTCGAGCTTGGCGCNTATCTGGGCTTCCTGTTGCAGCAGCAGGCGCAAGCCTGCCCGGATGACTTCGCT 
GCGATTCTGATAGCGGCCGGTCTCCACCAGCTCGTTGATATCCTGCTCCAGCGGATCGGGAAGGACGACGTTTCGCGTTG 
CCATGAGGTACTCCAGTCGGCAGATAGCTAG 
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>ORF10475 (SEQ ID NO:86) 

AGCATGGCCAAGTACCGCATCTCTCATGATGCCCAAGCGGACATCGTCGATATCCTGC6CTTCACCCACAACCACTTC6G 

CGATGCCGCGCGCCGACGTTACCAG(K:ACTCATA(MGGCGGCGCT(M3AAGCAGTTGCGACAGACC(^CAACAGGTA(KKA 

GCATCAGCCGTGAAGAACTGGGAGCTGGCCTGCGCAGCATCCACCTCGTTTACTGCCACTCGATGCCCAATGTCGGTAAG 

GTTGTTCGGCCCA(WCACTTCGTCTTCTACCGGGTGGCGACAGACCAGGTGCTAGAGGTGGTTC(K:GTGCTT 

CATGGATGTGGATCAACACCTGCCCCAACGATGA 

>ORF11095c (SEQ ID NO:88) 

AGCCGCATGCAAGCGGTGGTCAGCACGAATGCAAATGCTTGGTCAGGGGGAATGCAATCGAGTGGTCAAGCCACTGCTAT 
TGCGCATCAACCATGGGGCACCTGCTGGTGGATGTTCACCCGTAGCCTTTTCGTGTTCGCCGGCGCGAACGCAGCCCTTT 
CTGCCTTCCGGCAGGCCCTTTCGGGTAGGGCTTTTACCCTTGTGAACCATTCCCTTCGCCCTTCAAGCCCATTTCCCCTT 
TGGGCCATTTGCTCCTGTTACAGTTGCTCATCGTTGGGGCAGGTGTTGATCCACATCCATGGCGTCGTGAAGCACGCGAA 
CCACCTCTAG 

>ORF11264 (SEQ ID NO-.90) 

ACCGCGGTGCGGAGAGATCTCCTCAAACTGATGGGTTGCACGCATATCGAAGCAGATTACATAGGAGGCTTGCGCTGTTC 
AACAGCTCCTGAGGGGACTTGGGTTGCCCATGGTTTCCACGGCCCAATCGTTGACGTCATTGACGATTCCGCTGGCTTTT 
TCAGTACGCATCGCTTGGCGCTCCATTACCCAGCCCAATGCGGCCTTGCCGTTGACCAAGCGATTCCAAGGACTGCGATC 
CATGTAGCCAGCCCTCTAATGCATGTATGTATAGGTAAGGTCGTCGTTATTTCGGCGTGGATGTGCTGA 

>ORF11738 (SEQ ID NO:92) 

GAAGAGGTGATCATGAAGTTACAGGCATATCGGCTGCAGAACTACCGCCGGCTGCGCGATGTTGTCATCGAGCTCGATGA 
CGAAATTTCTATCTTTGTCGGTGCCAACAACAGCGGGMGA^ 

AAGTGAAGAAGTTCGAGCTCTTTGACTTCAGTGCGGCGCTGTGGGCCGAGATCGATGCGGTCGGCAGGACGCCCCCTGGC 
GATGAGGATGCGCCCAAAAGGTTACCGTCCATACTCTTGGATCTCTGGTTCCGCGTCGGTGAAGACGACCTCGCCACTGC 
GATGTCGCTGCTGCCGAGCACTGAGTGGGACGGCAAGTGCGTCGGGATCCGGGTAGCGTTCGAGCCTCGGGATGCCCACG 
AGCTCGTCTGGAAGTTCCATGAACTACATGAGAAGGCCAAC^ACGCAGCTGTCGCGCTTGCGGCCAAGCGCAAGGCCGCC 
GGGGAGCAAGCTGTGGA(^GGGCGCGGAA(&C(K:GGCTGC(X^^ 

AAGCCTGACGAAGTACCTCACAAAGGAACTGAGCAAGGAATACACCTTCCGCTACTACGTGCTCGATGAGCGGGCTTTTG 
TCGGCTATCAGGCAAGGGAGGCCGACTACGAGCCGCTACCCCTAGGCAAGGAGCCGGGCGGTGCAGCCATTCTCAAGTCG 
CTGGTGAGGGTCGACTTCCTGCGCGCGCAGCGGCACCTCGATGACCCAGATGCCGGTAGCTCTGATCGCGCAGAGAGCTT 
GTCGCGGCGTCTGAGCAGGTTCTATCACCGCAACCTGGAGAAGCGTGGCGACGACCATGCGGCTCTCAAGGCGCTAGATA 
CCTCGGAGAAGGAGCTGAACTTCCACCTGAAGGAAGTCTTCAATGACACCCTCACGCGCCTGGCCAAGCTCGGCTATCCG 
GGCGTCAACAATCCGGAGATCGTGATTCGGGCGGCCTTGGATCCGACCACTGTCTTGGGGCAAGACGCCAAGGTTCACTA 
CGTGATCCCGGGCGTAGCTTCCGCCCAACTGCCAGACAGCTACAATGGCCTGGGGTTCAAGAATCTGGTCTACATGGTGG 
TTGAGCTGCTCGACTTGCACGAGCAGTGGAAAGCCGAGGATGACAAGCGAGCTCCGCTTCATTTGGTCTTCATTGAGGAG 
CCTGAGGCGCATCTGCACGCGCAGATCCAGCAGGTCTTCATCAGGAACGTTTTGCGCCTCCTTGAGGATGCTAACGATCA 
CGCGACTTTGTTCCACACGCAGCTCGTCATCACCACGCACTCCCCGCACATCCTCTATGAACGCGGATTCTCGCCCATTC 
GGTACTTCCGCCGCGTCAACGACCAGTTGGGCCATCACACGGATGTGCGCAATCTGTCGCTATTCAAAACGGGCGCGTCC 
GACGCTCCAGCGCGCGAATTCCTGCAGCGGTATCTGAAGCTGACGCACTGCGATCTCTTTTTTTCCGACGCGGTGATATT 
GGTGGAAGGCAACGTCGAGCGTCTGCTCCTGCCTGCAATGATCGAGTTGGTGGCCAAGCGCCTGCGTTCTTCCGCCCTAA 
CCATCCTTGAAGTCGGTGGTGCGTTCGCGCATCGGTTCCAGGAGCTGATCGCCTTCGTTGGGCTCACAACACTGGTCATC 
ACGGATCTGGACACOTGACGGTCAA&CGGACGCCGAGM^GC^ 

CGGAGATGACGAGGACGAGGACGACGACCTGAAGCCCTTCGAGCTTGAAGACGACGACGAAGCAGAACCGAGTGGCAAGA 
AGAAGTCCAAGAAGCGTGGCAGCACCTGCCATGCACACGTGGAAGGTGCCGTCACGTCCAACCAAACCCTCATCAGCTGG 
ATCCCGAAGAAGCGGTCGATGGCAGAGCTCTGGGAAGTCACGGCGGAGCAAAAGACGCTGTCGCTGGCTGAGGATTCCAG 
CGCTGGGGTTCGGGTAGCTTACCAGACCAAGGTTTCGGTGACGGTGGGTGCGACGACATCACAGCTCTGCGGCCGCACAC 
TTGAGGAGGCCTTTGGTCTTGAGAACGCGGACTGGTGCCAGGCTGAGGCAAACCGGTCGGTCGGCCTCAAGCTCAAGCGC 
GCACCGAGCAGCCCTGAAGAGCTGGCTGAGAAGTTACACGATAGGGTGGTCGGCAAGAACTTCGACAAGACCCGCTTTGC 
GCTGGAGGTACTCGCAAGCGGGCCGCTCAATGGCTGGAAGGTTCCCGCGTACATCGCCGAGGGCTTGGCCTGGCTCGAAG 
CCAAAGTGGCCCACGAGCTTGAGGCGGATGCTGCCATCGCCACCGAGGTCGCGACTATTGAGCCGACTACAGCCGATGTT 
GTCGCTATCATTGTTGACCCGGGGCAGACGGCATGA 
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>0RF1 2348c (SEQ ID NO:94) 

c6gaaggtgtattccttgctcagttcctttgtgaggtacttcgtca6gctttctggccaag6cttgtactcgccggcatc 
ggccaccaccgcagccgcgtcttccgcgcccgcctccacagcttgctccccggcggccttgcgcttggccgcaagcgcga 
cagctk:gttgtt(km:cttctcatgtagttcatggaacttccagacgagctcgtgggcatcccgaggctcgaacgctacc 
cggatcccgacgcacttgccgtcccactcagtgctcggcagcagcgacatcgcagtggcgaggtcgtcttcaccgacgcg 
gaaccagagatccaagagtatggacggtaaccttttgggcgcatcctcatcgccagggggcgtcctgccgaccgcatcga 
tctcggcccacagcgccgcactgaagtcaaagagctcgaacttcttcacttcgccgcgaagcattgagtacaggccttgg 
acggcggatgtcttcccgctgttgttggcaccgacaaagatagaaatttcgtcatcgagctcgatgacaacatcgcgcag 
ccggcggtag 

>0RF1 2314c (SEQ ID NO.-96) 

GGTACTTCGTCAGGCTTTCTGGCCAAGGCTTGTACTCGCCGGCATCGGCCACCACCGCAGCCGCGTCTTCCGCGCCCGCC 
TCCACAGCTTGCTCCCCGGCGGCCTTGCGCTTGGCCGCAAGCGCGACAGCTGCGTTGTTGGCCTTCTCATGTAGTTCATG 
GAACTTCCAGACGAGCTCGTGGGCATCCCGAGGCTCGAACGCTACCCGGATCCCGACGCACTTGCCGTCCCACTCAGTGC 
TCGGCAGCAGCGACATCGCAGTGGCGAGGTCGTCTTCACCGACGCGGAACCAGAGATCCAAGAGTATGGACGGTAACCTT 
TTGGGCGCATCCTCATCGCCAGGGGGCGTCCTGCCGACCGCATCGATCTCGGCCCACAGCGCCGCACTGA 

>ORF13156c (SEQ ID NO:98) 

CGACAGATTGCGCACATCCGTGTGATGGCCCAACTGGTCGTTGACGCGGCGGAAGTACCGAATGGGCGAGAATCCGCGTT 
CATAGAGGATGTGCGGGGAGTGCGTGGTGATGACGAGCTGCGTGTGGAACAAAGTCGCGTGATCGTTAGCATCCTCAAGG 
AGGCGCAAAACGTTCCTGATGAAGACCTGCTGGATCTGCGCGTGCAGATGCGCCTCAGGCTCCTCAATGAAGACCAAATG 
AAGCGGAGCTCGCTTGTCATCCTCGGCTTTCCACTGCTCGTGCAAGTCGAGCAGCTCAACCACCATGTAGACCAGATTCT 
TGAACCCCAGGCCATTGTAGCTGTCTGGCAGTTGGGCGGAAGCTACGCCCGGGATCACGTAGTGAACCTTGGCGTCTTGC 
CCCAAGACAGTGGTC(MATC(^G(X:CG^^ 

GCGCGTGAGGGTGTCATTGAAGACTTCCTTCAGGTGGAAGTTCAGCTCCTTCTCCGAGGTATCTAG 
>ORF12795 (SEQ ID NO:100) 

CTTCCGCCCAACTGCCAGACAGCTACAATGGCCTGGGGTTCAAGAATCTGGTCTACATGGTGGTTGAGCTGCTCGACTTG 
CACGAGCAGTGGAAAGCCGAGGATGACAAGCGAGCTCCGCTTCATTTGGTCTTCATTGAGGAGCCTGAGGCGCATCTGCA 
CGCGCAGATCCAGCAGGTCTTCATCAGGAACGTTTTGCGCCTCCTTGAGGATGCTAACGATCACGCGACTTTGTTCCACA 
CGCAGCTCGTCATCACCACGG^CTCCCCGCACATCCTCTATGAACGCGGATTCTCGCCCATTCGGTACTTCCGCCGCGTC 
AACGACCAGTTGGGCCATCACACGGATGTGCGCAATCTGTCGCTATTCAAAACGGGCGCGTCCGACGCTCCAGCGCGCGA 
ATTCCTGCAGCGGTATCTGA 

>0RF1 2314c (SEQ ID NO:96) 

GGTACTTCGTCAGGCTTTCTGGCCAAGGCTTGTACTCGCCGGCATCGGCCACCACCGCAGCCGCGTCTTCCGCGCCCGCC 
TCCACAGCTTGCTCCCCGGCGGCCTTGCGCTTGGCCGCAAGCGCGACAGCTGCGTTGTTGGCCTTCTCATGTAGTTCATG 
GAACTTCCAGACGAGCTCGTGGGCATCCCGAGGCTCGAACGCTACCCGGATCCCGACGCACTTGCCGTCCCACTCAGTGC 
TCGGCAGCAGCGACATCGCAGTGGCGAGGTCGTCTTCACCGACGCGGAACCAGAGATCCAAGAGTATGGACGGTAACCTT 
TTGGGCGCATCCTCATCGCCAGGGGGCGTCCTGCCGACCGCATCGATCTCGGCCCACAGCGCCGCACTGA 

>ORF13156c (SEQ ID NO:98) 

CGACAGATTGCGCACATCCGTGTGATGGCCCAACTGGTCGTTGACGCGGCGGAAGTACCGAATGGGCGAGAATCCGCGTT 
CATAGAGGATGTGCGGGGAGTGCGTGGTGATGACGAGCTGCGTGTGGAACAAAGTCGCGTGATCGTTAGCATCCTCAAGG 
AGGCGCAAAACGTTCCTGATGAAGACCTGCTGGATCTGCGCGTGCAGATGCGCCTCAGGCTCCTCAATGAAGACCAAATG 
AAGCGGAGCTCGCTTGTCATCCTCGGCTTTCCACTGCTCGTGCAAGTCGAGCAGCTCAACCACCATGTAGACCAGATTCT 
TGAACCCCAGGCCATTGTAGCTGTCTGGCAGTTGGGCGGAAGCTACGCCCGGGATCACGTAGTGAACCTTGGCGTCTTGC 
CCCAAGACAGTGGTCGGATCCAAGGCCGCCCGAATCACGATCTCCGGATTGTTGACGCCCGGATAGCCGAGCTTGGCCAG 
GCGCGTGAGGGTGTCATTGAAGACTTCCTTCAGGTGGAAGTTCAGCTCCTTCTCCGAGGTATCTAG 
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>ORF12795 (SEQ ID NO:100) 

CTTCC6CCCAACTGCCAGACA6CTACAATG6CCT66GGTTCAAGAATCTGGTCTACAT6GTGGTTGAGCTGCTCGACTTG 
CACGAGCAGTGGAAAGCCGAGGATGACAAGCGAGCTCCGCTTCATTTGGTCTTCATTGAGGAGCCTGAGGCGCATCTGCA 
CGCGCAGATCCAGCAGGTCTTCATCAGGAACGTTTTGCGCCTCCTTGAGGATGCTAACGATCACGCGACTTTGTTCCACA 
CGCAGCTCGTCATCACCACGCACTCCCCGCACATCCTCTATGAACGCGGATTCTCGCCCATTCGGTACTTCCGCCGCGTC 
AACGACCAGTTGGGCCATCACACGGATGTGCGCAATCTGTCGCTATTCAAAACGGGCGCGTCCGACGCTCCAGCGCGCGA 

ATTCCTGCAGCGGTATCTGA 
>ORF13755c (SEQ ID NO:210) 

GCTACCCGAACCCCAGCGCTGGAATCCTCAGCCAGCGACAGCGTCTTTTGCTCCGCCGTGACTTCCCAGAGCTCTGCCAT 
CGACCGCTTCTTCGGGATCCAGCTGATGAGGGTTTGGTTGGACGTGACGGCACCTTCCACGTGTGCATGGCAGGTGCTGC 
CACGCTTCTTGGACTTCTTCTTGCCACTCGGTTCTGCTTCGTCGTCGTCTTCAAGCTCGAAGGGCTTCAGGTCGTCGTCC 
TCGTCCTCGTCATCTCCGTCAACGGCGCCCTCAGCGCCTGCGCCTTGCGCGGCGGCCTTCTCGGCGTCCGTCTTGACCGT 
CACGCTGTCCAGATCCGTGATGACCAGTGTTGTGAGCCCAACGAAGGCGATCAGCTCCTGGAACCGATGCGCGAACGCAC 
CACCGACTTCAAGGATGGTTAGGGCGGAAGAACGCAGGCGCTTGGCCACCAACTCGATCATTGCAGGCAGGAGCAGACGC 
TCGACGTTGCCTTCCACCAATATCACCGCGTCGGAAAAAAAGAGATCGCAGTGCGTCAGCTTCAGATACCGCTGCAGGAA 
TTCGCGCGCTGGAGCGTCGGACGCGCCCGTTTTGAATAGCGACAGATTGCGCACATCCGTGTGA 

>ORF13795c (SEQ ID NO:212) 

TGTCGTCGCACCCACCGTCACCGAAACCTTGGTCTGGTAAGCTACCCGAACCCCAGCGCTGGAATCCTCAGCCAGCGACA 
GCGTCTTTTGCTCCGCCGTGACTTCCCAGAGCTCTGCCATCGACCGCTTCTTCGGGATCCAGCTGATGAGGGTTTGGTTG 
GACGTGACGGCACCTTCCACGTGTGCATGGCAGGTGCTGCCACGCTTCTTGGACTTCTTCTTGCCACTCGGTTCTGCTTC 
GTCGTCGTCTTCAAGCTCGAAGGGCTTCAGGTCGTCGTCCTCGTCCTCGTCATCTCCGTCAACGGCGCCCTCAGCGCCTG 
CGCCTTGCGCGGCGGCCTTCTCGGCGTCCGTCTTGACCGTCACGCTGTCCAGATCCGTGATGACCAGTGTTGTGAGCCCA 
ACGAAGGCGATCAGCTCCTGGAACCGATGCGCGAACGCACCACCGACTTCAAGGATGGTTAG 

>ORF14727c (SEQ ID NO:214) 

CAGGAAGTCGGCGAGCTGAAGGATGTCCTCGTGGCCAAGTATGCCCTTGGCGTAGTCACTGCCCACGCCGTAGTTGAACG 
TCCTGACGCCGGCCACAGCCTCCAGGCTTCGGACATATCGCTCTTGGTCGGCCTTGTTCCTGTCGCGCGTGGTCTGCCGG 
ACACGCGAGCTGTAATTCTCGAACTCTTCTTCAAGTTCGGAGATCCGCCTGCGGATGTCGTTCTGCAGCCAAACCTTGAT 
GTCGGCCTGGAACGTCTTTGCAATAGACCAGTAAAAGCTGTGGATGGTCGAGACATGAACCAGCGGGTCATCGTTGACGT 
CCGCCAGGATTTCATTGGTGGCAAGGTCGGTATACGTGATGCACGCGACTATCTGCTTCCTCGCCCGCATGCTGGCGCCG 
TGCTCCGAGATCACCCAGTCCAGCGCCTTGATGAGGGAGGTGGTCTTGCCGGAACCTGCGCCAGCACGAACCACGAAGGG 
CTGCGGAGGCGTCGCTACAATGCATGCGTGGATCTCGCGGTCGGCGTCGGTATCTGGGCTATCAATTCGTCTGCTCATGC 
CGTCTGCCCCGGGTCAACAATGATAGCGACAACATCGGCTGTAGTCGGCTCAATAGTCGCGACCTCGGTGGCGATGGCAG 
CATCCGCCTCAAGCTCGTGGGCCACTTTGGCTTCGAGCCAGGCCAAGCCCTCGGCGATGTACGCGGGAACCTTCCAGCCA 
TTGAGCGGCCCGCTTGCGAGTACCTCCAGCGCAAAGCGGGTCTTGTCGAAGTTCTTGCCGACCACCCTATCGTGTAACTT 
CTCAGCCAGCTCTTCAGGGCTGCTCGGTGCGCGCTTGAGCTTGAGGCCGACCGACCGGTTTGCCTCAGCCTGGCACCAGT 
CCGCGTTCTCMGACCAAAGGCCTCCTCMGTGTGCGGCCGCAGAGCTGTGATGTCGTCGCACCCACCGTCACCGAAACC 

TTGGTCTGGTAA 

>ORF13779 (SEQ ID NO:216) 

CGGTGGGTGCGACGACATCACAGCTCTGCGGCCGCACACTTGAGGAGGCCTTTGGTCTTGAGAACGCGGACTGGTGCCAG 
GCTGAGGCAAACCGGTCGGTCGGCCTCAAGCTCAAGCGCGCACCGAGCAGCCCTGAAGAGCTGGCTGAGAAGTTACACGA 
TAGGGTGGTCGGCAAGAACTTCGACAAGACCCGCTTTGCGCTGGAGGTACTCGCAAGCGGGCCGCTCAATGGCTGGAAGG 
TTCCCGCGTACATCGCCGAGGGCTTGGCCTGGCTCGAAGCCAAAGTGGCCCACGAGCTTGAGGCGGATGCTGCCATCGCC 
ACCGAGGTCGCGACTATTGAGCCGACTACAGCCGATGTTGTCGCTATCATTGTTGACCCGGGGCAGACGGCATGAGCAGA 

CGAATTGA 
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>ORF14293c (SEQ ID NO:218) 

6G6A6GTGGTCTT6CCGGAACCT6C6CCA6CACGAACCACGAA6GGCTGCGGAGGCGTCGCTACAATGCATGCGTGGATC 
TCGCGGTCGGCGTCGGTATCTGGGCTATCMTTCGTCTGCTCATGCCGTCTGCCCCGGGTCAACAATGATAGCGACAACA 
TCGGCTGTAGTCGGCTC^TAGTCGCGACCTCGGTGGCGATGGCAGCATCCGCCTCAAGCTCGTGGGCCACTTTGGCTTC 
GAGCCAGGCCAAGCCCTCGGCGATGTACGCGGGAACCTTCCAGCCATTGAGCGGCCCGCTTGCGAGTACCTCCAGCGCAA 
AGCGGGTCTTGTCGAAGTTCTTGCCGACCACCCTATCGTGTAA 

>ORF14155 (SEQ ID NO.-220) 

CCCGGGGCAGACGGCATGAGCAGACGAATTGATAGCCCAGATACCGACGCCGACCGCGAGATCCACGCATGCATTGTAGC 
GACGCCTCCGCAGCCCTTCGTGGTTCGTGCTGGCG(^GGTTCCGGCAAGACCACCTCCCTCATCAAGGCGCTGGACTGGG 
TGATCTCGGAGCACGGCGCCAGCATGCGGGCGAGGAAGCAGATAGTCGCGTGCATCACGTATACCGACCTTGCCACCAAT 
GAAATCCTGGCGGACGTCAACGATGACCCGCTGGTTC^^ 

GTTCCAGGCCGACATCAAGGTTTGGCTGCAGAACGACATCCGCAGGCGGATCTCCGAACTTGAAGAAGAGTTCGAGAATT 

ACAGCTCGCGTGTCCGGCAGACCACGCGCGACAGGAACAAGGCCGACCAAGAGCGATATGTCCGAAGCCTGGAGGCTGTG 

GCCGGCGTCAGGACGTTCAACTACGGCGTGGGCAGTGACTACGCCAAGGGCATACTTGGCCACGAGGACATCCTTCAGCT 

CGCCGACTTCCTGCTACAAAACCGCCCGCTGTTCCGACGGGTCGTGGCGCTGAGCTACCCGTTCGTGTTTATCGATGAGA 

GTCAGGACACGTTCCCGGGTGTAGTGMGTCTTTCMGGAAGTGGAAGCCCAGATGCAGGGCMGTTCTGCCTTG^ 

TTCGGCGACCCGATGCAGTCGATCTTCATGAGAGGCGCAGGGGACATCCAGCTTGAGGATCATTGGCGGGCCATCACGAA 

GCCGGAGAACTTTCGCTGCGCCAAGCAGATCCTTGACGTCGCCAATGCCGTGCGCGCGCAGGGCGATGGCATGGAGCAAG 

TCCGCGGGCTGCACGAGAGGGTCGATGGGAACCTCAAGCTGGTGGAGGGGTCGGCCCGGATGTTCGTCTTGCCGAACACG 

CTGAACCGAACCGAGGCTTTGGCAAGAGTCCGAGCGTGGAGCTCGGCGACGAACAACGACGAGGGTTGGACAACCCCAGA 

CATCGCAGTCAAGATTCTTGTCATCGTGCACCGCATGGCM 

ACGACAAGACGTCGGATGCCATGAAGCAAGGGATGCAGGACGGCACCGGTTGGCCCGTTCGACCCTTCCTAAGTTTTGCG 
CTACCGATCGTTGCAGCTGTGAAGGCCGGCAATGAGTTCGCGGCGATGAGCCTGCTCCGGGAATTCAGCCCGCGCCTGGC 
GCCTGCGGCTCTGACCGGCCGACGTGCCGCGGATGTATTGCGAGAGCTGCACGCTGCTGCGTCGAGGCTTGTCGCCATGC 
TGGACGAGGCAGGGACCACCATTGGTGACATAGCTCTCCATCTCTGTGACACGGGTCTTTTTGAGTTCGACGAGCGCTAT 
GCGCGTGTTCTTGGGTTTGTCAGGGATATTGCTGACACCGCTCAGGAGCCCGAGGCTGCTGATGCAGTTCCGGCCGAAGG 
ATTATCCTTGGACGCGACAATGGCCAAGTTCTTCAATTGCTCTGCGCAAGAGCTTTGGCCCTATGAACGCTATGTCTCAG 
AAGGCTCCCCCTATGCCACGCAGCACGGCGTGAAGGGAGCGCAGTTCGAACGCGTCATGGTGGTGATGGACGAGGAAGAA 
AGCGACTACCGAACGTACAACTACGAGCGTGTCTTCGCGAGTGCTGAGGCCCGCGCTGCAGATCGTGCACGAGCACTAGA 
CGGTGATGAAAACACTTGGAGCCGAACGCTGCGACTGCTTTACGTCTGCTGCACTCGTGCCCAGCGGGGGCTGGTACTAG 
CGTTCTTTGTCGCCGACCCTGCGACCACCCTGGAAAACGTCGTGGCGAGCGGGATCTTGCCGCGAAGCGCAGTCTTTACG 
CAGGAAGTGTTAGTTGGATGGCCATAG 

>ORF14360 (SEQ ID NO:222) 

TCGCGTGCATCACGTATACCGACCTTGCCACCAATGAAATCCTGGCGGACGTCAACGATGACCCGCTGGTTCATGTCTCG 
ACCATCCACAGCTTTTACTGGTCTATTGCAAAGACGTTCCAGGCCGACATCAAGGTTTGGCTGCAGAACGACATCCGCAG 
GCGGATCTCCGAACTTGAAGAAGAGTTCGAGAATTACAGCTCGCGTGTCCGGCAGACCACGCGCGACAGGAACAAGGCCG 
ACCAAGAGCGATATGTCCGAAGCCTGGAGGCTGTGGCCGGCGTCAGGACGTTCAACTACGGCGTGGGCAGTGACTACGCC 
AAGGGCATACTTGGCCACGAGGACATCCTTCAGCTCGCCGACTTCCTGCTACAAAACCGCCCGCTGTTCCGACGGGTCGT 
GGCGCTGA 

>ORF15342c (SEQ ID NO:224) 

GAAGGGTCGAACGGGCCAACCGGTGCCGTCCTGCATCCCTTGCTTCATGGCATCCGACGTCTTGTCGTTCAGCGCCGAGT 
AGATGCCGCCGAAGCCAAGCCGGTTTGCGGCCATGCGGTGCACGATGACAAGAATCTTGACTGCGATGTCTGGGGTTGTC 
CAACCCTCGTCGTTGTTCGTCGCCGAGCTCCACGCTCGGACTCTTGCCAAAGCCTCGGTTCGGTTCAGCGTGTTCGGCAA 
GACGAACATCCGGGCCGACCCCTCCACCAGCTTGAGGTTCCCATCGACCCTCTCGTGCAGCCCGCGGACTTGCTCCATGC 
CATCGCCCTGCGCGCGCACGGCATTGGCGACGTCAAGGATCTGCTTGGCGCAGCGAAAGTTCTCCGGCTTCGTGATGGCC 
CGCCAATGATCCTCAAGCTGGATGTCCCCTGCGCCTCTCATGAAGATCGACTGCATCGGGTCGCCGAAAAAACCAAGGCA 
GAACTTGCCCTGCATCTGGGCTTCCACTTCCTTGAAAGACTTCACTACACCCGGGAACGTGTCCTGACTCTCATCGATAA 
ACACGAACGGGTAGCTCAGCGCCACGACCCGTCGGAACAGCGGGCGGTTTTGTAG 
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>ORF15260c (SEQ ID N0.226) 

ATGCCGCCGAA6CCAA6CCGGTTT6CGGCCAT6CGGTGCACGAT6ACAAGAATCTT6ACTGC6AT6TCTGGGGTT6TCCA 

ACCCTCGTCGTTGTTCGTCGCCGAGCTCCACGCTCGGACTCTTGCCAAAGCCTCGGTTCGGTTCAGCGTGTTCGGCAAGA 

CGAACATCCGGGCCGACCCCTCCACCAGCTTGAGGTTCCCATCGACCCTCTCGTGCAGCCCGCGGACTTGCTCCATGCCA 

TCGCCCTGCGCGCGCACGGCATTGGCGACGTC^^ 

CCAATGA 

>ORF14991 (SEQ ID NO:228) 

CGTCGCCAATGCCGTGCGCGCGCAGGGCGATGGCATGGAGCAAGTCCGCGGGCTGCACGAGAGGGTCGATGGGAACCTCA 
AGCTGGTGGAGGGGTCGGCCCGGATGTTCGTCTTGCCGAACACGCTGAACCGAACCGAGGCTTTGGCAAGAGTCCGAGCG 
TGGAGCTCGGCGACGAACAACGACGAGGGTTGGACAACCCCAGACATCGCAGTCAAGATTCTTGTCATCGTGCACCGCAT 
GGCCGCAAACCGGCTTGGCTTCGGCGGCATCTACTCGGCGCTGAACGACAAGACGTCGGATGCCATGAAGCAAGGGATGC 
AGGACGGCACCGGTTGGCCCGTTCGACCCTTCCTAAGTTTTGCGCTACCGATCGTTGCAGCTGTGAAGGCCGGCAATGA 

>ORF15590c (SEQ ID NO:230) 

CGCTCGTCGAACTCAAAAAGACCCGTGTCACAGAGATGGAGAGCTATGTCACCAATGGTGGTCCCTGCCTCGTCCAGCAT 
GGCGACAAGCCTCGACGCAGCAGCGTGCAGCTCTCGCAATACATCCGCGGCACGTCGGCCGGTCAGAGCCGCAGGCGCCA 
GGCGCGGGCTGMTTCCCGGAGCAGGCTCATCGCCGCGMCTCATTGCCGGCCTTCACAGCTGCAACGATCGGTAGCGCA 
AAACTTAGGAAGGGTCGAACGGGCCAACCGGTGCCGTCCTGCATCCCTTGCTTCATGGCATCCGACGTCTTGTCGTTCAG 
CGCCGAGTAG 

>ORF15675c (SEQ ID NO:232| 

TCCTTCGGCCGGAACTGCATCAGCAGCCTCGGGCTCCTGAGCGGTGTCAGCAATATCCCTGACAAACCCAAGAACACGCG 
CATAGCGCTCGTCGAACTCAAAAAGACCCGTGTCACAGAGATGGAGAGCTATGTCACCAATGGTGGTCCCTGCCTCGTCC 
AGCATGGCGACAAGCCTCGACGCAGCAGCGTGCAGCTCTCGCAATACATCCGCGGCACGTCGGCCGGTCAGAGCCGCAGG 
CGCCAGGCGCGGGCTGAATTCCCGGAGCAGGCTCATCGCCGCGAACTCATTGCCGGCCTTCACAGCTGCAACGATCGGTA 
G 

>ORF16405 (SEQ ID N0.234) 

ATCGACTCTTTGAGGAAATGCGTGGGAAGCCTGGAAAAGTGCTGTTTCGCCTGCAAAGAAATAATTCATGTTCATGCGAT 
TCGTTGTCGGCAGTGCGGCGAGTCCCAAGGCTGGCGAAGGTTCATGAGCTCTCCAACCTCAGTAGTTGCGTTGGTCCTTA 
GCCTTTTATCAATCGCTGCCACAAAACCTGTGGAGCGATTGTTCGATGCCCAGCGAGCAGAGCTACAAATCTCG^TCACG 
GGTGGTGATTAGU&GCTGCCCAGCTTATGTTGACC^ 

ATCGAAAGCCACGACCAATACGAAAACATGGTTTTTGGTAAGCAATACGGATGGCGAAATTCTGGAGCCAGGCAAAACTT 
ACAAAATCAGGGCCTCAACCGATGAGTCTATCCCAAAAATTGTCGAAGCTGAGCGTCGGACGATTTTGAAGTCTCAGTAC 
GCACTTGCAGATAATTGCGAATTAACCGCTAAATACATAGAGGCCACGGGGCAGAAGGTTGTGCGTGTGCAACCGTTCAT 
GTGCGACACACCTCCTGAAAAGGGTGGCCTGCCCCCTGGTAAACCTGGCATACCCATTTGGTACCTTGGTCAAGAATGA 

>ORF16925 (SEQ ID NO:236) 

AGGCCACGGGGCAGAAGGTTGTGCGTGTGCAACCGTTCATGTGCGACACACCTCCTGAAAAGGGTGGCCTGCCCCCTGGT 
AAACCTGGCATACCCATTTGGTACCTTGGTCAAGAATGATGTTTTTATGCCGCCCTGGGCTTTGACGCCGATTAAGCAAA 
GCTGTGTTCGCTCATCCAATACGTCCCTCGCCCAGTTAAACGACTGTTATGTATATGGGTGCTGCCGCTACGTAATACCT 
TGGCCCTACGCATACGAAGTTAATTCTGAAAGCGTTCAATGGACAATCTTCCTCCTCGGCGTCGACTGCAGCGGTAAGGT 
GATCTACTTTCGAAACACTGCAAGGGTAGGTCCTTTTTTGGCAGCGTCCATATACCGACCGTGGTATGGCTCAGATGCGC 
TGGTACTGCATTTCACCAAATAA 
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>ORF17793c {SEQ ID NO:238) 

GCCAAAAT6ATT6TCATTGACAAAAATCTAGAACATCTT6TTGCGCAAT6CGCTATATGT6AAAAAACTTTATTT6AC6A 
GTTTTCTCTCAAGATTCAATTGGGGCATACATATTACGAGCCAAAATCTTTGCCCGCCTCTGCAAGCATTGTATATGGGT 
CGCATCCAGCCCCGTCGACGTTTTTTTTGGAACCAAAAGAAATTCAGCAAAATTTGGTGCTGAAATCCGGTGAGCAAGTC 
ATCACCTGCAGTAAACATCGATACAAAATACCGTTAGATTATTTTGGTCTGGTGCAAACCAAAGGAACCCTTGCGCGATT 
GTTCGTGCAGGTAACCTGTAATGACGGTCAGGTAGAGCCGGGGTTCGACGGGTACGTAACCCTTGAAATCGTCAATATGT 
CGCCTTGGACGATAGAAATACCGGCCGTGAGCGATATAGCACAACTTTATTTGGTGAAATGCAGTACCAGCGCATCTGAG 
CCATACCACGGTCGGTATATGGACGCTGCCAAAAAAGGACCTACCCTTGCAGTGTTTCGAAAGTAG 

>ORF18548c (SEQ ID NO:240) 

AGGACAATGGCAGGGTGGCCGCGTCTCGCAGCCC^GGACGAAGGACAAATCTGATGAGTGTGTTACAGATC^AAGGGCG 
TACAACGAAATCCCACACGGATTTTGACGCGGCATCGTACTCCAGCAACAGCCTTATACTCACTGATGCAGGGGACGAGA 
GAATTGAAGAGTTTTCCCTCGAATTGTCCGTGGGTGAAGGGTGGAGTGATAACTATTCTGGCAACGACAAAAACCTGTGG 
CGCATTGTCGATGGTATGACGATCAGGGGTCACGATTCTGTTGTGGTGGAGGCCGCTGAAGAAATCAAGGTGCCGCACAA 
TCGGTACGGCATAGTCCTACCTACGGGAAGTCTTTTTCTCTCACGCGGCGTGCTGGTTGCTTCGGCGAAGGTCGAACCTG 
CATTTGATGGCAAGCTCAAGCTCAGGATATTCMCACCACCAACAAAAATGTCTGCCTTACCAAAGGCGAGAAGCTTGGC 
TCTGTGATTTTTTTCTCCACAGAATCGACGCACACCCAAAGCCCCATCAAGCGTGGCAGTGAAATATCGACGCTTCCCAT 
CACGCGGCGCGCGCGATTGAAGAAGTGGTTTTCGCTCAATCCCACCATATGGGTCGGGTGGACGCTGAATTTAATCGGAA 
GTTCCCTGGTGTCTTCTCTTATAATGTACGCCGTCTATTACAAGGTTGTGCTGGAACACCAGTCGCAGCCTCCTCAGTCA 
CAACAAAACGCTCAGCCATCGCCGAACGAAGTTAAGCCAAAATGA 

>ORF17875 (SEQ ID NO:242) 

ACGGCGTACATTATAAGAGAAGACACCAGGGAACTTCCGATTAAATTCAGCGTCCACCCGACCCATATGGTGGGATTGAG 
CGAAAACCACTTCTTCAATCGCGCGCGCCGCGTGATGGGAAGCGTCGATATTTCACTGCCACGCTTGATGGGGCTTTGGG 
TGTGCGTCGATTCTGTGGAGAAAAAAATCACAGAGCCAAGCTTCTCGCCTTTGGTAAGGCAGACATTTTTGTTGGTGGTG 
TTGAATATCCTGAGCTTGAGCTTGCCATCAAATGCAGGTTCGACCTTCGCCGAAGCAACCAGCACGCCGCGTGAGAGAAA 
AAGACTTCCCGTAGGTAGGACTATGCCGTACCGATTGTGCGGCACCTTGATTTCTTCAGCGGCCTCCACCACAACAGAAT 

CGTGA 

>ORF18479 (SEQ ID NO:244) 

TCTGTAACACACTCATCAGATTTGTCCTTCGTCCTTGGGCTGCGAGACGCGGCCACCCTGCCATTGTCCTTTATACCGGC 
CGATATCCCCGGATACCGCCTGAAAGATGACGTGCGCAAAGCGTGCACCAATCTGAATTTCAAACGCCTCGCTGTGATTG 
TTGGTGAGCGCGAACGTCATCGGCCCTACATAACCTGGAGGCAGCACACTGGAACTGAACGTTATCCCGCTTCTGAACAG 
CGTGCTTCTCGGAAAAAAAAGCGCCGCCAGATCTTCCGGCAGATCGAATTCTTCCATGGTGCTCGCCAGATAAGTCTTGC 
CCGGTTCCATGACGAAGCAGTCATCCGGGTCTGCGAGCACGACCTCGCTGGCAGGGGTGCGTCGCGTAGATTCTCGCAAG 
CTTCCACCCCCTACTGTCAGGCGAGAGAGGCCTGCGAGTCTGAGGTCAAATCCAACGCCTTCCGGGGTGGTCAACTCACG 
GTGGGCAAGGTGCTTGATTAG 

>ORF19027c (SEQ ID NO:246) 

ATGATTTACTCACCGCACTCGCTCCTGAAACTGGTCCGGGATGGAAAACTAATCAAGCACCTTGCCCACCGTGAGTTGAC 
CACCCCGGAAGGCGTTGGATTTGACCTCAGACTCGCAGGCCTCTCTCGCCTGACAGTAGGGGGTGGAAGCTTGCGAGAAT 
CTACGCGACGCACCCCTGCCAGCGAGGTCGTGCTCGCAGACCCGGATGACTGCTTCGTCATGGAACCGGGCAAGACTTAT 
CTGGCGAGCACCATGGAAGAATTCGATCTGCCGGAAGATCTGGCGGCGCTTTTTTTTCCGAGAAGCACGCTGTTCAGAAG 
CGGGATAACGTTCAGTTCCAGTGTGCTGCCTCCAGGTTATGTAGGGCCGATGACGTTCGCGCTCACCAACAATCACAGCG 
AGGCGTTTGAAATTCAGATTGGTGCACGCTTTGCGCACGTCATCTTTCAGGCGGTATCCGGGGATATCGGCCGGTATAAA 
GGACAATGGCAGGGTGGCCGCGTCTCGCAGCCCAAGGACGAAGGACAAATCTGA 

>ORF19305 (SEQ ID NO:248) 

TGGCCGTTCTCTGCCTGTCGCCTCTTTGGCATGACTGGTCAAGTCGGATGCAAACGGTGGTCAGCACCAATGCAATTGGG 
TGGTCATGTGCGATGCAATTACGCAGTTGAGCCTGGCCCAGTTCCTCCCAAGCAAAGCATAAGACCAAGATGGCACATTG 
CCAACAAAATACCCTTCCCCGCTACCGTTGTTTTATCGTTGTTGCCAGCCCTGATCTGGCGGAAAAGCCCGCTCCATGAA 
TCGTCATGGAGCCTCCCATGTTTCAACTCCTTTCCTGGATATCCAGGAAGCCGTCCCCCACCCCAACAACCAAAGCTGCC 

CCAGGGGGATTCATCCTTCCTCTGA 
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>ORF19519 (SEQ ID NO:250) 
TCTGGCGGAAAAGCCCGCTCCATGAATCGTCATGGAGOT 
CCCCCACCCCMCMCCmGCTGCOT 
CGCCGGCAGCTACTGGAGAACATCTGGCAGCGCG^ 

GGCCMCTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGATC 

AC(WCCTGGAGATCGT(K3CCTACGCACTCAAGGTACGGCAGACCTACCTGCTCCCGATCGG(m^ 

TCAGCCCAGGCTGA 

>ORF19544 (SEQ ID NO:252) 
ATCGTCATGGAGCCTCCCATGTTTCAACTCCm^ 

CCCAGGGGGATTCATCCTTCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTCGCCGCCGGCAGCTACTGGAGAACATCT 
GGCAGCGCGCCTCGCTATCCAAGCAGCAATTCGA^ 

CAGCTCCCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGATCACGGCCTGGAGATCGTGGCCTACGC 

ACTCAAGGTACGGCAGACCTACCTGCTCCCGATCGGCGCAGCGCCGGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGG 

CCGCCGCGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCGTCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGC 

ACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAGTCCCGCGAATACCAGCTCCACGG 

CGCTGCCTCAGCACTTCTCATCCACCAACTGCTACCGCGCACTGCACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGG 

CTCAATT&TCTACCTGTTC(KTGGGCAGTAC(&(^ 

TCAGTTGM(&AGCTAGGA(KmTCCGGATCGAGCTCTGGCT^^ 

CGGCCTTCGCTTCT1GGTGAAGGACAAGTTCAAGTTGAATCAACCTAGCGGCCCG1CTGATGGATGGCTGACCCAGGACG 
CACTCTGGCTGGTGAGCAAGCCTGCTGCCGATCAACTGAGAGCCTACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCC 
TCTAACGCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGC 
CACGGTAGACAACGGTGCTGGATGGAGAAACAAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTG 
CCGAGCGCCCCTCACCCTACAGCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGT 
GAAATTCCG^ACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACGAAGATGATGCICCATCAACCTGCGCCGAGCGTTGC 
GAAACCGGCAAACGAGACGCAGGCGATTGCGAAACCC1CAACTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTC 
TTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACTCGCCGGCTGCCTCTCCTACGAACACACGCGGG 
GA(&AGAACCTACAGCAGCCACTA(mCCAAGGA<KCAA^ 

TAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACG 

ACACCAA(3GCTTTGGTGCATACCGTAGACG(3GACC(KCAT(K:T(K;TCACGCCAGGAATTTTCAAGCGCTATGTCC^^ 

CATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGG 

GCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGG 

CCTACCTGCTCCAGGATCCCAMTTKTGTTCC^^ 

GAAGGAGGTGTGGAATGA 

>ORF20008 (SEQ ID NO:254) 

GCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAGTCCCGCG 
AATACCAGCTCCACGGCOTGCCTCAGCACTO 

TTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGCGAGATCATCGT 

GAAGGCAGACCAGGCCTCAGTTGCACAGGAGCTAGGAGGCAATCCGGATCGAGCTCTGGCTGCACCGAAGCAGTCOT 

AGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCAACCTAG 

>ORF20623c (SEQ ID NO:256) 

CGTGAACTTGTTTCTCCATCCAGCACCGTTGTCTACCGTGGCCGTCCAAATGGCCTTGTCCTCGGCATTTGTCTGGATGA 
CGGCTTGGTCCTGGAGCATGCTGAAGAACGGCGCGTTAGAGGAGGGCACCCCATCGATACCCTGGGCCAGCAGGTAGGCT 
CTCAGTTGATCGGCAGCAGGCTTGCTCACCAGCCAGAGTGCGTCCTGGGTCAGCCATCCATCAGACGGGCCGCTAGGTTG 
ATTCAACTTGAACTTGTCCTTCACCAAGAAGCGAAGGCCGTCTGCCAACTGCCGCTGCAGCGACTGCTTCGGTGCAGCCA 
GAGCTCGATCCGGATTGCCTCCTAG 
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>ORF21210c (SEQ ID NO:258) 

C6CTT6AAAATTCCTG6CGT6ACCA6CAT6GCGGTCCCGTCTAC66TAT6CACCAAAGCCTT6GTGTCGTT6AT6AACAG 
GCGACGGGCCGCGATGCCAGATTTCATCCAACCAACGAATCCCTGTCCCAGATCAGTACTTCTGCTAGGCATAAATACAT 
CTTCAATTGCTTCAGGAGCGCAATCTGTTGGCTCCTTGGTCCCTAGTGGCTGCTGTAGGTTCTCCTCCCCGCGTGTGTTC 
GTAGGAGAGGCAGCCGGCGAGTCGTGGCTAGTGTCTAGCTCTTCTAGTGGCGAATTGATATTACCAAGAAGTGCATACAA 
ATCGTCTGTTTCTTCTTGA 

>ORF21493c (SEQ ID NO:260) 

GCTGCTGCGGCGTCATTCCACACCTCCTTCGGCATCGGTGATGACCGTGAGGCTTGGGTTGTCCAGAGGCTGCTCAGGGA 

ACAGCAATnGGGATCCTGGAGC^GGTAGGCCTTGAGCTC^ 

TTCAGGTTTTTACT03TCTTCCGAT(^^ 

GGCTTGGGCCAGTTTTTCAAGCACCGGATGCTCTTGGACATAGCGCTTGAmTTCCTGGCGTGACO^GCATGGCGGTCC 
CGTCTACGGTATGCACCAAAGCCTTGGTGTCGTTGATGAACAGGCGACGGGCCGCGATGCCAGATTTCATCCAACCAACG 
AATCCCTGTCCCAGATCAGTACTTCTGCTAGGCATAAATACATCTTCAATTGCTTCAGGAGCGCAATCTGTTGGCTCCTT 
GGTCCCTAG 

>ORF21333 (SEQ ID NO:262) 
AttTCTGGACCAT^GGTTTCTGGTCCTCGCM^^ 

TTCCCTGAGCAGCCTCTGGAC^CC(^GCCT(^CGGTCATCACCGATGCCGAAGGAGGTGTGGAATGACGCCGCAGCAG 
CTCACCGAGGAGTACATCTTCGCGCACGATCTCC^GAAGCCAGCKGMGATCTACCKGCCGCGACCAAGGCGCTGCT 
CAAGCACTTCGGTCCTACGGCAACCGTACAGGACGTGGACCACCGGGCTGTCCTGGGATGGCGACGCAAGGTACTGGAAC 
AAGGCCTGTCCAAGCGGAGCTGGAACACGTACTCGAATCATCTGCGGACGATCTGGGGCTATGCCATCGAGCATGA 

>ORF22074c (SEQ ID NO:264) 

gtgaggcacaagcccctccgttattggcactacgaactctttgtgagtcttctctgtctcgccgcggatgaggatcagtt 

gattttcccagtcgatgtcgcgcttgcggatgcacaacagcgcattcaaccggatgccggtgaagtagaagacctcaaac 

gtgcaaagccagaaccaggcgggcgtgatccgtgcgcgttcgccggtgcagcgctctgcgccgtcctgcatgttgagcca 

attgcgggcgagcaggatggcttcggcggcgacggttttgcttgctcgcctggggggaatgacggtggtctttctgaacg 

ggttgacttgggagtgcgtcaccaactcatgctcgatggcatagccccagatcgtccgcagatgattcgagtacgtgttc 

cagctccgcttggacaggccttgttccagtaccttgcgtcgccatcccaggacagcccggtggtccacgtcctgtacggt 

t(tccgtaggaccgaagtgcttgagca«:gcctt(mtcgcggcgcggtagatcttcgcgctggcttcccg(^ga^^ 

cgaagatgtactcctcggtgagctgctgcggcgtcattccacacctccttcggcatcggtgatgaccgtgaggcttgggt 

tgtccagaggctgctcagggaacagcaatttgggatcctggagcaggtag 

>orf21421 (seq id no:266) 

gcagcctctggacaacccaagcctcacggtcatcaccgatgccgaaggaggtgtggaatgacgccgcagcagctcaccga 
ggagtacatcttcgcgcacgatctccgggaagcca(x:gcgaagatctaccgcgccgc(5ac(^ggcgctgctcaagcact 
tcggtcctacggcaaccgtacaggacgtggaccaccgggctgtcctgggatggcgacgcaaggtactggaacaaggcctg 
tccaagcggagctggaacacgtactcgaatcatctgcggacgatctggggctatgccatcgagcatgagttggtgacgca 
ctcccaagtcaacccgttcagaaagaccaccgtcattccccccaggcgagcaagcaaaaccgtcgccgccgaagccatcc 
tgctcgcccgcaattggctcaacatgcaggacggcgcagagcgctgcaccggcgaacgcgcacggatcacgcccgcctgg 
ttctggctttgcacgtttgaggtcttctacttcaccggcatccggttgaatgcgctgttgtgcatccgcaagcgcgacat 
cgactgggaaaatcaactgatcctcatccgcggcgagacagagaagactcacaaagagttcgtagtgccaataacggagg 
ggcttgtgcctcacctatcgaggctcctgcaagaggccgatagagccggattcgccgatgacgaccagttgttcaacgtc 
aaccggttctcaccgcactacaagagcaaggtgatgaactccgaccaggtcgaagccatgtaccggaagttgaccgagaa 
ggttggggtgcggatgaccccgcaccgtttccggcacaccctggccaccgacttgatgaaggcacccgagcggaacattc 
acctcacgaagtgcctgctcaaccactcgaatatccagaccacgatgagctacatcgaggccgactacgatcacatgcgt 
gccgtgctgcatgctagaagcctggcccaaggcgcgctggagaatgtcaggaaggtggattacagcggctccccgcaagc 
ctctgccaaaccgaagccatgcgggcaacctctcgctcgagtgagtgaagcgccgccaccggaggccaggacagagcctg 
cagaaccaagggagcacacgccagggacaggcattcagggaggtccaaccgcgtgggaagqgatgcgctaccacagcca 
cctgacaccttcgaaccaagcgtgctgttcactctgatggctcaaaacttatcgaaccgtgccgcctcggcatccgcggc 
tcccgctgcaacaagcggatcaggcggatggggatctgccgcccgaagcaatctcgcctag 
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>ORF22074c (SEQ ID NO:264) 

GTGA6GCACAAGCCCCTCCGTTATT66CACTAC6AACTCTTTGTGA6TCTTCTCTGTCTCGCCGCGGATGAGGATCA6TT 
GATTTTCCCAGTCGATGTCGCGCTTGCGGATGCACAACAGCGCATTCAACCGGATGCCGGTGAAGTAGAAGACCTCAAAC 
GTGCAAAGCCAGAACCAGGCGGGCGTGATCCGTGCGCGTTCGCCGGTGCAGCGCTCTGCGCCGTCCTGCATGTTGAGCCA 
ATTGCGGGCGAGCAGGATGGCTTCGGCGGCGACGGTTTTGCTTGCTCGCCTGGGGGGAATGACGGTGGTCTTTCTGAACG 
GGTTGACTTGGGAGTGCGTCACCAACTCATGCTCGATGGCATAGCCCCAGATCGTCCGCAGATGATTCGAGTACGTGTTC 
CAGCTCCGCTTGGACAGGCCTTGTTCCAGTACCTTGCGTCGCCATCCCAGGACAGCCCGGTGGTCCACGTCCTGTACGGT 
T«:CGTAGGACCGMGTGCTTGAGCAGCGCCTTGGTCGCGGCGC«3TAGATCTTCGCGCTGGCTTCCCGGAGATCGTGCG 
CGAAGATGTACTCCTCGGTGAGCTGCTGCGGCGTCATTCCACACCTCCTTCGGCATCGGTGATGACCGTGAGGCTTGGGT 
TGTCCAGAGGCTGCTCAGGGAACAGCAATTTGGGATCCTGGAGCAGGTAG 

>ORF21421 (SEQ ID NO:266) 

GCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGACGCCGCAGCAGCTCACCGA 
GGAGTACATCTTCGCGCACGATCTCCGGGAAGCCAGCGCGAAGATCTACCGCGCCGCGACCAAGGCGCTGCTCAAGCACT 
TCGGTCCTACGGCAACCGTACAGGACGTGGACCACCGGGCTGTCCTGGGATGGCGACGCAAGGTACTGGAACAAGGCCTG 
TCCAAGCGGAGCTGGAACACGTACTCGAATCATCTGCGGACGATCTGGGGCTATGCCATCGAGCATGAGTTGGTGACGCA 
CTCCCAAGTCAACCCGTTCAGAAAGACCACCGTCATTCCCCCCAGGCGAGCAAGCAAAACCGTCGCCGCCGAAGCCATCC 
TGCTCGCCCGCAATTGGCTCAACATGCAGGACGGCGCAGAGCGCTGCACCGGCGAACGCGCACGGATCACGCCCGCCTGG 
TTCTGGCTTTGCACGTTTGAGGTCTTCTACTTCACCGGCATCCGGTTGAATGCGCTGTTGTGCATCCGCAAGCGCGACAT 
CGACTGGGAAAATCAACTGATCCTCATCCGCGGCGAGACAGAGAAGACTCACAAAGAGTTCGTAGTGCCAATAACGGAGG 
GGCTTGTGCCTCACCTATCGAGGCTCCTGCAAGAGGCCGATAGAGCCGGATTCGCCGATGACGACCAGTTGTTCAACGTC 
AACCGGTTCTCACCGCACTACMGAGCAAGGTGATGAACTCCGACCAGGTCGAAGCCATGTACCGGAAGTTGACCGAGAA 
GGTTGGGGTGCGGATGACCCCGCACCGTTTCCGGCACACCCTGGCCACCGACTTGATGAAGGCACCCGAGCGGAACATTC 
ACCTCACGAAGTGCCTGCTCAACCACTCGAATATCCAGACCACGATGAGCTACATCGAGGCCGACTACGATCACATGCGT 
GCCGTGCTGCATGCTAGAAGCCTGGCCCAAGGCGCGCTGGAGAATGTCAGGAAGGTGGATTACAGCGGCTCCCCGCAAGC 
CTCTGCCAAACCGAAGCCATGCGGGCAACCTCTCGCTCGAGTGAGTGAAGCGCCGCCACCGGAGGCCAGGACAGAGCCTG 
CAGAACCAAGGGAGCACACGCCAGGGACAGGCATTCAGGGAGGTCCAACCGCGTGGGAAGCAGATGCGCTACCACAGCCA 
CCTGACACCTTCGAACCAAGCGTGCTGTTCACTCTGATGGCTCAAAACTTATCGAACCGTGCCGCCTCGGCATCCGCGGC 
TCCCGCTGCAACAAGCGGATCAGGCGGATGGGGATCTGCCGCCCGAAGCAATCTCGCCTAG 

>ORF22608c (SEQ ID NO:268) 

CGCATCTGCTTCCCACGCGGTTGGACCTCCCTGAATGCCTGTCCCTGGCGTGTGCTCCCTTGGTTCTGCAGGCTCTGTCC 
TGGCCTCCGGTGGCGGCGCTTCACTCACTCGAGCGAGAGGTTGCCCGCATGGCTTCGGTTTGGCAGAGGCTTGCGGGGAG 
CCGCTGTAATCCACCTTCCTGACATTCTCCAGCGCGCCTTGGGCCAGGCTTCTAGCATGCAGCACGGCACGCATGTGATC 
GTAGTCGGCCTCGATGTAGCTCATCGTGGTCTGGATATTCGAGTGGTTGAGCAGGCACTTCGTGAGGTGAATGTTCCGCT 
CGGGTGCCTTCATCAAGTCGGTGGCCAGGGTGTGCCGGAAACGGTGCGGGGTCATCCGCACCCCAACCTTCTCGGTCAAC 
TTCCGGTACATGGCTTCGACCTGGTCGGAGTTCATCACCTTGCTCTTGTAGTGCGGTGA 

>ORF22626 (SEQ ID NO:270) 

CACCTTCGAACCAAGCGTGCTGTTCACTCTGATGGCTCAAAACTTATCGAACCGTGCCGCCTCGGCATCCGCGGCTCCCG 
CTGCAACAAGCGGATCAGGCGGATGGGGATCTGCCGCCCGAAGCAATCTCGCCTAGCGATACCGGTACTGAGGGCCGGCT 
ACCGGACGAAAGGTAGCCGTGCCTTCCAGCAGATCGTTAGGCCTGTAGGAAAAATCTGGAATTACCGAGAGCGCCTGGAT 
TCCAGCGCCGGCATGCTGGCAGAGCCAGCGCAATTTCAAGGCCAATACCACAGTACCCTCTGTAATCGCTGA 
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>ORF23228 (SEQ ID NO:272) 

AGAGATTCGAACTCCCGACATCCTGCTCCCAAA6CA6GC6CGCTACCG6ACT6CGCTATACCCCGATT6GAATTT66CTC 
CGCGACCTGGACTCGAACCAGGGACCCAATGATTAACAGTCATTTGCTCTACCGACTGAGCTATCGCGGAACGTCTTTCT 
TCCAACCCTGGACGCTTCCGGTGTTGCTGGATTCGCGTCTCAGAGGCGCGCCATTTTACGGATGCGCGCGGGCATGTCAA 
CCCTCTGATCCAAAAAGTTTTTCTTCTTTTTCCACGAGCGACAAAACGGCCCTTCCACTGCATGCGGCAGCGCTCTCGCG 
CCTACCGGACGCCCATGAAAAAGCCCCGCCGAAGCGGGGCTTTCCCTGTCCGCCCCCGAAGAGGTCAGGCGAAGACGATC 
TCGTCGCCTTCCACCTTCGCCGAGATACTGGCACCCGGCGCGAATTTGCCGGCCAGGATCAGTTGCGCCAGCGGGTTCTC 
GATCCAGCGCTGGATGGCCCGCTTCAGCGGGCGTGCGCCATAGACCGGGTCGAAGCCGACGGCAATCAGCTTGTCCAGCG 
CCTCCTGGCTCAGTTCCAGGCTCAGCTCGCGCTCGGCCAGGCGCTTGCGCAGGCGACCGAGCTGGATCTCGGCGATGCCG 
GCGATCTGCTCGCGAGCCAGCGGCTCGAACACCACCACTTCGTCGATCCGGTTGATGAATTCCGGACGGAAGTGCGCATT 
GACCGCGTCCATCACTGCGGCACGTTGCGCCTCGCGGTCGCCGGCCAGCTCCTGGATCTGCGCCGAACCGAGGTTGGAGG 
TCATCACCACCACGGTGTTGCGGAAGTCCACCGTACGCCCGTGACTGTCGGTCAGGCGTCCGTCCTCGAGCACCTGGAGG 
AGAATGTTGAATACATCCGGATGGGCCTTCTCCACCTCGTCCAGCAGCACCACCGAGTAGGGCTTGCGGCGGATCGCCTC 
GGTCAGGTAGCCGCCTTCCTCGAAGCCGACGTAGCCCGGAGGCGCGCCGATCAGGCGGGCCACCGAGTGTTTCTCCATGA 
ACTCGGACATATCTATCCGCACCAGCGCCTCCTCGGTATCGAAGAGGAACTCGGCCAGCGCCTTGCACAACTCGGTCTTG 
CCCACCCCGGTCGGGCCGAGGAAGAGGAACGAGCCGCTCGGCCGGTTCGGATCGGCGAGGCCGGCGCGCGAACGGCGCAC 
GGCGTTGGACACGGCGACTACCGCCTCGTCCTGGCCGATCACTCGCCGATGCAGCTCCTGCTCCATGCGCAGCAGCTTCT 
CGCGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTGGAAACCACTTCGGCGATTTCCTCGTCGGTCACC 
TTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACCATCTGCAGGCTGCGTTCCAGGTCCGGGATGGTCTG 
GTACTGGATGCGCGCCATGCTCTCGAGGTCGCCCTTGCGCCGCGCCGCCTCCATCTCCTGCTTGGCCTGCTCGATCTTCT 
GCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCCTCGAGGTCGGCGTATTCGCGCTCGAGC 
TTGACGATATCCTCCTCCAGCTTGGCCAGGCGCTTCCTGGTGGCTTCGTCGTCTTCCTTCTTCAGCGCCTCGCGCTCGAT 
CTTCAGCTGGATCAGGCGACGGTCGAGACGATCCAGTTCCTCCGGCTTGGAGTCGATCTCCATGCGGATGCGGCTGGCGG 
CCTCGTCGATCAGGTCGATGGCCTTGTCCGGCAGTTGCCGATCGGTGATGTAGCGGTGCGACAGCTTGGCCGCGGCGATG 
ATCGCGCCGTCGGTGATGCTCACCCCGTGGTGCACTTCATAGCGTTCCTTGAGGCCACGGAGGATGGCGATGGTGTCTTC 
CTCGCTCGGTTCGTCCACCAGCACCTTCTGGAAGCGGCGCTCCAGCGCGGCATCCTTCTCGATGTACTGGCGATACTCGT 
CGAGGGTAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCAGAGCCGGCTTGAGCATGTTGCCGGCGTCCATGGCACCT 
TCCGCCTTGCCGGCGCCGACCATGGTGTGCAGTTCGTCGATGAACAGGATGACCCGGCCTTCCTGCTTGCCCAGTTCGTT 
GAGGACCGCCTTCAGGCGTTCCTCGAACTCGCCGCGGAACTTGGCACCGGCGATCAGCGCCCCCATGTCCAGGGCCAGCA 
GGCGCTTGTCCTTGAGGCCGTCCGGCACTTCGCCGTTGATGATGCGCTGGGCCAGGCCCTCGACGATGGCGGTCTTGCCG 
ACGCCGGGTTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGCCGCTGCAGGACCTGGATGGTCCGGCGGATCTCGTCGTC 
GCGACCGATCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTTGGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCG 
ACTCCTCGACGTTCGGGTCGTTCACCGCTTCGCCGCCACGCAGGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACACG 
CCCTGGCCGAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCCATCGCGGCCAGCAATACCAGCTCGCTGGAGATGAACTG 
GTCGCCCTTCTGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAGGCGTGCGAGATCCTGGGACAGGTTCACGTCGCCGGTCG 
GGCTCTGGATCTTCGGCAGCGCGTCGAGTTCTTTGTTGAGGCCGCTGCGCAGGGCGGCGATATCGAAGCCGACCTGCATC 
AGCAGGGGCTTGATCGAACCGCCTTGCTGCTCGAGCAGGGCGGAAAGCAGGTGCACCGGCTCGATGGCCGGATGGTCATG 
GCCAACGGCCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAGCTTGCTGGTCAAACGGTCTATTCGCATGGGTCGTCCTT 

CCTTCTATAG 

>ORF23367 (SEQ ID NO:274) 

GCTATCGCGGAACGTCTTTCTTCCAACCCTGGACGCTTCCGGTGTTGCTGGATTCGCGTCTCAGAGGCGCGCCATTTTAC 
GGATGCGCGCGGGCATGTCAACCCTCTGATCCAAAAAGTTTTTCTTCTTTTTCCACGAGCGACAAAACGGCCCTTCCACT 
GCATGCGGCAGCGCTCTCGCGCCTACCGGACGCCCATGAAAAAGCCCCGCCGAAGCGGGGCTTTCCCTGTCCGCCCCCGA 
AGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTCGCCGAGATACTGGCACCCGGCGCGAATTTGCCGGCCAGGAT 
CAGTTGCGCCAGCGGGTTCTCGATCCAGCGCTGGATGGCCCGCTTCAGCGGGCGTGCGCCATAG 
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>ORF25103c (SEQ ID NO:276) 

AGTGCACCACG66GTGAGCATCACCGACGGC6CGATCATC6CCGCG6CCAAGCT6TC6CACC6CTACATCACCGATCGGC 
AACTGCCGGACAAGGCCATCGACCTGATCGACGAGGCCGCCAGCCGCATCCGCATGGAGATCGACTCCAAGCCGGAGGAA 
CTGGATCGTCTCGACCGTCGCCTGATCCAGCTGAAGATCGAGCGCGAGGCGCTGAAGAAGGAAGACGACGAAGCCACCAG 
GAAGCGCCTGGCCAAGCTGGAGGAGGATATCGTCAAGCTCGAGCGCGAATACGCCGACCTCGAGGAGATCTGGAAGTCCG 
AGAAGGCCGAGGTGCAGGGCTCGGCGCAGATCCAGCAGAAGATCGAGCAGGCCAAGCAGGAGATGGAGGCGGCGCGGCGC 
AAGGGCGACCTCGAGAGCATGGCGCGCATCCAGTACCAGACCATCCCGGACCTGGAACGCAGCCTGCAGATGGTCGACCA 
GCACGGCAAGACCGAGAACCAGTTGCTGCGCAACAAGGTGACCGACGAGGAAATCGCCGAAGTGGTTTCCAAGTGGACCG 
GTATCCCGGTGTCGAAGATGCTCGAGGGCGAGCGCGAGAAGCTGCTGCGCATGGAGCAGGAGCTGCATCGGCGAGTGATC 
GGCCAGGACGAGGCGGTAGTCGCCGTGTCCAACGCCGTGCGCCGTTCGCGCGCCGGCCTCGCCGATCCGAACCGGCCGAG 
CGGCTCGTTCCTCTTCCTCGGCCCGACCGGGGTGGGCAAGACCGAGTTGTGCAAGGCGCTGGCCGAGTTCCTCTTCGATA 
CCGAGGAGGCGCTGGTGCGGATAGATATGTCCGAGTTCATGGAGAAACACTCGGTGGCCCGCCTGATCGGCGCGCCTCCG 
GGCTACGTCGGCTTCGAGGAAGGCGGCTACCTGACCGAGGCGATCCGCCGCAAGCCCTACTCGGTGGTGCTGCTGGACGA 
GGTGGAGAAGGCCCATCCGGATGTATTCAACATTCTCCTCCAGGTGCTCGAGGACGGACGCCTGACCGACAGTCACGGGC 
GTACGGTGGACTTCCGCAACACCGTGGTGGTGATGACCTCCAACCTCGGTTCGGCGCAGATCCAGGAGCTGGCCGGCGAC 
CGCGAGGCGCAACGTGCCGCAGTGATGGACGCGGTCAATGCGCACTTCCGTCCGGAATTCATCAACCGGATCGACGAAGT 
GGTGGTGTTCGAGCCGCTGGCTCGCGAGCAGATCGCCGGCATCGCCGAGATCCAGCTCGGTCGCCTGCGCAAGCGCCTGG 
CCGAGCGCGAGCTGAGCCTGGAACTGAGCCAGGAGGCGCTGGACAAGCTGATTGCCGTCGGCTTCGACCCGGTCTATGGC 
GCACGCCCGCTGAAGCGGGCCATCCAGCGCTGGATCGAGAACCCGCTGGCGCAACTGATCCTGGCCGGCAAATTCGCGCC 
GGGTGCCAGTATCTCGGCGAAGGTGGAAGGCGACGAGATCGTCTTCGCCTGACCTCTTCGGGGGCGGACAGGGAAAGCCC 
CGCTTCGGCGGGGCTTTTTCATGGGCGTCCGGTAGGCGCGAGAGCGCTGCCGCATGCAGTGGAAGGGCCGTTTTGTCGCT 
CGTGGAAAAAGAAGAAAAACTTTTTGGATCAGAGGGTTGACATGCCCGCGCGCATCCGTAAAATGGCGCGCCTCTGA 

>ORF23556 (SEQ ID NO:278) 

AAAAGCCCCGCCGAAGCGGGGCTTTCCCTGTCCGCCCCCGAAGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTC 
GCCGAGATACTGGCACCCGGCGCGAATTTGCCGGCCAGGATCAGTTGCGCCAGCGGGTTCTCGATCCAGCGCTGGATGGC 
CCGCTTCAGCGGGCGTGCGCCATAGACCGGGTCGAAGCCGACGGCAATCAGCTTGTCCAGCGCCTCCTGGCTCAGTTCCA 
GGCTCAGCTCGCGCTCGGCCAGGCGCTTGCGCAGGCGACCGAGCTGGATCTCGGCGATGCCGGCGATCTGCTCGCGAGCC 
AGCGGCTCGAACACCACCACTTCGTCGATCCGGTTGA 
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>ORF26191c (SEQ ID NO:280) 

AA66AAGGACGACCCATGC6AATAGACC6TTTGACCAGCAA6CTGCAACTG6CGCTCTCCGAC6CCCAGTCCCT6GCCGT 
TGGCCATGACCATCCGGCCATCGAGCCGGTGCACCTGCTTTCCGCCCTGCTCGAGCAGCAAGGCGGTTCGATCAAGCCCC 
TGCTGATGCAGGTCGGCTTCGATATCGCCGCCCTGCGCAGCGGCCTCAACAAAGAACTCGACGCGCTGCCGAAGATCCAG 
AGCCCGACCGGCGACGTGAACCTGTCCCAGGATCTCGCACGCCTGCTCAACCAGGCTGACCGCCTGGCCCAGCAGAAGGG 
CGACCAGTTCATCTCCAGCGAGCTGGTATTGCTGGCCGCGATGGACGAGMQCCAGGCTCGGCAAGCTGCTGCTCGGCC 
AGGGCGTGTCGCGCAAGGCGCTGGAGAATGCCGTGGCCAACCTGCGTGGCGGCGAAGCGGTGAACGACCCGAACGTCGAG 
GAGTCGCGCCAGGCGCTGGACAAGTACACCGTCGACATGACCAAGCGCGCCGAGGAAGGCAAGCTCGACCCGGTGATCGG 
TCGCGACGACGAGATCCGCCGGACCATCCAGGTCCTGCAGCGGCGGACCAAGAACAACCCGGTGCTGATCGGCGAACCCG 
GCGTCGGCAAGACCGCCATCGTCGAGGGCCTGGCCCAGCGCATCATCAACGGCGAAGTGCCGGACGGCCTCAAGGACAAG 
CGCCTGCTGGCCCTGGACATGGGGGCGCTGATCGCCGGTGCCAAGTTCCGCGGCGAGTTCGAGGAACGCCTGAAGGCGGT 
CCTCAACGAACTGGGCAAGCAGGAAGGCCGGGTCATCCTGTTCATCGACGAACTGCACACCATGGTCGGCGCCGGCAAGG 
CGGAAGGTGCCATGGACGCCGGCAACATGCTCAAGCCGGCTCTGGCGCGCGGCGAGCTGCACTGCGTCGGTGCTACTACC 
CTCGACGAGTATCGCCAGTACATCGAGAAGGATGCCGCGCTGGAGCGCCGCTTCCAGAAGGTGCTGGTGGACGAACCGAG 
CGAGGAAGACACCATCGCCATCCTCCGTGGCCTCAAGGAACGCTATGAAGTGCACCACGGGGTGAGCATCACCGACGGCG 
CGATCATCGCCGCGGCCMGCTGTCGCACCGCTACATCACCGATCGGCAACTGCCGGACAAGGCCATCGACCTGATCGAC 
GAGGCCGCCAGCCGCATCCGCATGGAGATCGACTCCAAGCCGGAGGAACTGGATCGTCTCGACCGTCGCCTGATCCAGCT 
GAAGATCGAGCGCGAGGCGCTGAAGAAGGAAGACGACGAAGCCACCAGGAAGCGCCTGGCCAAGCTGGAGGAGGATATCG 
TCAAGCTCGAGCGCGAATACGCCGACCTCGAGGAGATCTGGAAGTCCGAGAAGGCCGAGGTGCAGGGCTCGGCGCAGATC 
C^GCAGAAGATCGAGCAGGCCAAGCAGGAGATGGAGGCGGCGCGGCGCAAGGGCGACCTCGAGAGCATGGCGCGCATCCA 
GTACCAGACCATCCCGGACCTGGAACGCAGCCTGCAGATGGTCGACCAGCACGGCAAGACCGAGAACCAGTTGCTGCGCA 
ACAAGGTGACCGACGAGGAAATCGCCGAAGTGGTTTCCAAGTGGACCGGTATCCCGGTGTCGAAGATGCTCGAGGGCGAG 
CGCGAGAAGCTGCTGCGCATGGAGCAGGAGCTGCATCGGCGAGTGATCGGCCAGGACGAGGCGGTAGTCGCCGTGTCCAA 
CGCCGTGCGCCGTTCGCGCGCCGGCCTCGCCGATCCGAACCGGCCGAGCGGCTCGTTCCTCTTCCTCGGCCCGACCGGGG 
TGGGCAAGACCGAGTTGTGCAAGGCGCTGGCCGAGTTCCTCTTCGATACCGAGGAGGCGCTGGTGCGGATAGATATGTCC 
GAGTTCATGGAGAAACACTCGGTGGCCCGCCTGATCGGCGCGCCTCCGGGCTACGTCGGCTTCGAGGAAGGCGGCTACCT 
GACCGAGGCGATCCGCCGCAAGCCCTACTCGGTGGTGCTGCTGGACGAGGTGGAGAAGGCCCATCCGGATGTATTCAACA 
TTCTCCTCCAGGTGCTCGAGGACGGACGCCTGACCGACAGTCACGGGCGTACGGTGGACTTCCGCAACACCGTGGTGGTG 
ATGACCTCCAACCTCGGTTCGGCGCAGATCCAGGAGCTGGCCGGCGACCGCGAGGCGCAACGTGCCGCAGTGATGGACGC 
GGTCAATGCGCACTTCCGTCCGGAATTCATCAACCGGATCGACGAAGTGGTGGTGTTCGAGCCGCTGGCTCGCGAGCAGA 
TCGCCGGCATCGCCGAGATCCAGCTCGGTCGCCTGCGCAAGCGCCTGGCCGAGCGCGAGCTGAGCCTGGAACTGAGCCAG 
GAGGCGCTGGACAAGCTGATTGCCGTCGGCTTCGACCCGGTCTATGGCGCACGCCCGCTGAAGCGGGCCATCCAGCGCTG 
GATCGAGAACCCGCTGGCGCAACTGATCCTGGCCGGCAAATTCGCGCCGGGTGCCAGTATCTCGGCGAAGGTGGAAGGCG 
ACGAGATCGTCTTCGCCTGA 

>ORF23751 (SEQ ID NO:282) 

ACCGGGTCGAAGCCGACGGCAATCAGCTTGTCCAGCGCCTCCTGGCTCAGTTCCAGGCTCAGCTCGCGCTCGGCCAGGCG 
CTTGCGCAGGCGACCGAGCTGGATCTCGGCGATGCCGGCGATCTGCTCGCGAGCCAGCGGCTCGAACACCACCACTTCGT 
CGATCCGGTTGATGAATTCCGGACGGAAGTGCGCATTGACCGCGTCCATCACTGCGGCACGTTGCGCCTCGCGGTCGCCG 
GCCAGCTCCTGGATCTGCGCCGAACCGAGGTTGGAGGTCATCACCACCACGGTGTTGCGGAAGTCCACCGTACGCCCGTG 
A 

>ORF24222 (SEQ ID NO:284) 

CCCGGAGGCGCGCCGATCAGGCGGGCCACCGAGTGTTTCTCCATGAACTCGGACATATCTATCCGCACCAGCGCCTCCTC 
GGTATCGAAGAGGAACTCGGCCAGCGCCTTGCACAACTCGGTCTTGCCCACCCCGGTCGGGCCGAGGAAGAGGAACGAGC 
CGCTCGGCCGGTTCGGATCGGCGAGGCCGGCGCGCGAACGGCGCACGGCGTTGGACACGGCGACTACCGCCTCGTCCTGG 
CCGATCACTCGCCGATGCAGCTCCTGCTCCATGCGCAGCAGCTTCTCGCGCTCGCCCTCGAGCATCTTCGACACCGGGAT 
ACCGGTCCACTTGGAAACCACTTCGGCGATTTCCTCGTCGGTCACCTTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGT 
GCTGGTCGACCATCTGCAGGCTGCGTTCCAGGTCCGGGATGGTCTGGTACTGGATGCGCGCCATGCTCTCGAGGTCGCCC 
TTGCGCCGCGCCGCCTCCATCTCCTGCTTGGCCTGCTCGATCTTCTGCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTT 
CTCGGACTTCCAGATCTCCTCGAGGTCGGCGTATTCGCGCTCGAGCTTGACGATATCCTCCTCCAGCTTGGCCAGGCGCT 
TCCTGGTGGCTTCGTCGTCTTCCTTCTTCAGCGCCTCGCGCTCGATCTTCAGCTGGATCAGGCGACGGTCGAGACGATCC 
AGTTCCTCCGGCTTGGAGTCGATCTCCATGCGGATGCGGCTGGCGGCCTCGTCGATCAGGTCGATGGCCTTGTCCGGCAG 
TTGCCGATCGGTGATGTAG 
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>ORF24368 (SEQ ID NO.-286) 
ACTC66ACATATCTATCCGCACCAGC6CCTCCTCGGTATCGAAGAGGAACTCGGCCAGC6CCTT6CACAACTC66TCTTG 
CC&CCCC(WTC(^CCGAGGAAGAGGAACGAGCC(K:TC^ 

GGCGTTGGACACGGCGACTACCGCCTCGTCCTGGCCGATCACTCGCCGATGCAGCTCCTGCTCCATGCGCAGCAGCTTCT 
CGCGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTGGAAACCACTTCGGCGATTTCCTCGTCGGT(^CC 
TTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACCATCTGCAGGCTGCGTTCCAGGTCCGGGATGGTCTG 
GTACTGGAT(£GCGCCATGCTCTCGAGGTCGCCCTT<m 

GCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCCTCGAGGTCGGCGTATTCGCGCTCGAGC 
TTGA 

>ORF24888c (SEQ ID NO:288) 

AGAAGGAAGACGACGAAGCCACCAGGAAGCGCCTGGCCAAGCTGGAGGAGGATATCGTCAAGCTCGAGCGCGAATACGCC 
GACCTCGAGGAGATCTGGAAGTCCGAGAAGGCCGAGGTGCAGGGCTCGGCGCAGATCCAGCAGAAGATCGAGCAGGCCAA 
GCAGGAGATGGAGGCGGCGCGGCGCAAGGGCGACCTCGAGAGCATGGCGCGCATCCAGTACCAGACCATCCCGGACCTGG 
AACGCAGCCTGCAGATGGTCGACCAGCACGGCAAGACCGAGAACCAGTTGCTGCGCAACAAGGTGA 

>ORF25398c (SEQ ID NO:290) 

AGGCGGTCCTCAACGAACTGGGCAAGCAGGAAGGCCGGGTCATCCTGTTCATCGACGAACTGCACACCATGGTCGGCGCC 
GGCAAGGCGGAAGGTGCCATGGACGCCGGCAACATGCTCAAGCCGGCTCTGGCGCGCGGCGAGCTGCACTGCGTCGGTGC 
TACTACCCTCGACGAGTATCGCCAGTACATCGAGAAGGATGCCGCGCTGGAGCGCCGCTTCCAGAAGGTGCTGGTGGACG 
AACCGAGCGAGGAAGACACCATCGCCATCCTCCGTGGCCTCAAGGAACGCTATGAAGTGCACCACGGGGTGA 

>ORF25892c (SEQ ID NO:292) 

CCGCCTGGCCCAGCAGAAGGGCGACCAGTTCATCTCCAGCGAGCTGGTATTGCTGGCCGCGATGGACGAGAACACCAGGC 
TCGGCAAGCTGCTGCTCGGCCAGGGCGTGTCGCGCAAGGCGCTGGAGAATGCCGTGGCCAACCTGCGTGGCGGCGAAGCG 
GTGAACGACCCGAACGTCGAGGAGTCGCGCCAGGCGCTGGACAAGTACACCGTCGACATGACCAAGCGCGCCGAGGAAGG 
CAAGCTCGACCCGGTGATCGGTCGCGACGACGAGATCCGCCGGACCATCCAGGTCCTGCAGCGGCGGACCAAGAACAACC 
CGGTGCTGATCGGCGAACCCGGCGTCGGCAAGACCGCCATCGTCGAGGGCCTGGCCCAGCGCATCATCAACGGCGAAGTG 
CCGGACGGCCTCAAGGACAAGCGCCTGCTGGCCCTGGACATGGGGGCGCTGATCGCCGGTGCCAAGTTCCGCGGCGAGTT 
CGAGGAACGCCTGAAGGCGGTCCTCAACGAACTGGGCAAGCAGGAAGGCCGGGTCATCCTGTTCATCGACGAACTGCACA 
CCATGGTCGGCGCCGGCMGGCGGAAGGTGCCATGGACGCCGGCAACATGCTCAAGCCGGCTCTGGCGCGCGGCGAGCTG 
CACTGCGTCGGTGCTACTACCCTCGACGAGTATCGCCAGTACATCGAGAAGGATGCCGCGCTGGAGCGCCGCTTCCAGAA 
GGTGCTGGTGGACGAACCGAGCGAGGAAGACACCATCGCCATCCTCCGTGGCCTCAAGGAACGCTATGA 

>ORF25110 (SEQ ID NO:294| 

CGTTCCTTGAGGCCACGGAGGATGGCGATGGTGTCTTCCTCGCTCGGTTCGTCCACCAGCACCTTCTGGAAGCGGCGCTC 
CAGCGCGGCATCCTTCTCGATGTACTGGCGATACTCGTCGAGGGTAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCA 
GAGCCGGCTTGAGCATGTTGCCGGCGTCCATGGCACCTTCCGCCTTGCCGGCGCCGACCATGGTGTGCAGTTCGTCGATG 
AACAGGATGACCCGGCCTTCCTGCTTGCCCAGTTCGTTGAGGACCGCCTTCAGGCGTTCCTCGAACTCGCCGCGGAACTT 
GGCACCGGCGATCAGCGCCCCCATGTCCAGGGCCAGCAGGCGCTTGTCCTTGAGGCCGTCCGGCACTTCGCCGTTGATGA 
TGCGCTGGGCCAGGCCCTCGACGATGGCGGTCTTGCCGACGCCGGGTTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGC 
CGCTGCAGGACCTGGATGGTCCGGCGGATCTCGTCGTCGCGACCGATCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTT 
GGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCGACGTTCGGGTCGTTCACCGCTTCGCCGCCACGCA 
GGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACACGCCCTGGCCGAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCC 
ATCGCGGCCAGCAATACCAGCTCGCTGGAGATGAACTGGTCGCCCTTCTGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAG 
GCGTGCGAGATCCTGGGACAGGTTCACGTCGCCGGTCGGGCTCTGGATCTTCGGCAGCGCGTCGAGTTCTTTGTTGAGGC 
CGCTGCGCAGGGCGGCGATATCGAAGCCGACCTGCATCAGCAGGGGCTTGATCGAACCGCCTTGCTGCTCGAGCAGGGCG 
GAAAGCAGGTGCACCGGCTCGATGGCCGGATGGTCATGGCCAACGGCCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAG 
CTTGCTGGTCAAACGGTCTATTCGCATGGGTCGTCCTTCCTTCTATAGAGCGGGCCGGAACGATGGGTGTCCCTGA 

>ORF25510 (SEQ ID NO:296) 

TGCGCTGGGCCAGGCCCTCGACGATGGCGGTCTTGCCGACGCCGGGTTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGC 
CGCTGCAGGACCTGGATGGTCCGGCGGATCTCGTCGTCGCGACCGATCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTT 
GGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCGACGTTCGGGTCGTTCACCGCTTCGCCGCCACGCA 
GGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACACGCCCTGGCCGAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCC 
ATCGCGGCCAGCAATACCAGCTCGCTGGAGATGA ■-. r\ r\r\ 
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>ORF26762c (SEQ ID NO:298) 

CC6CCGACT6CCT6CCG6CGTTGTTCTGCGACCGCTC66GCACCC(^TG6CCGCGGCCCATGCCGGCTGGCGC66GCT6 
GCGGCGGGCGTGCTGGAGGCGACGGTGGACAGCCTGGGCGTGCCCGGCGACGAACTGCTGGTCTGGCTGGGGCCGGCGAT 
CGGCCCGCAGGCCTTCGAGGTCGGCGGCGAGGTCCGCGATGCATTCGTCGCTGCGCACGCCGAGGCGCGCTCGGCTTTCG 
TACCTAGCGC(MTCCGGGCCGCTTCATGGCCGACATCTACCGACTCGCGCGGATCCGCCTGGGCGCCCATGGCGTCACC 
GCCGTGCATGGCGGCGGCTTCTGCACCTTCAGCGATACCGCGCGCTTCTATTCCTACCGCCGCTCGTCGCGTACCGGCCG 
TTTTGCCAGCCTGGTCTGGCTCCAGGACTAGGCCCGCGCAGGTTATCCGGCGGCAACTGACCGATGTCACGGTCCGGTCG 
CTTGAACCGCGGAAAATCGCCCTTATCTACTGA 

>ORF26257(SEQIDNO:300) 

ATAAGGGCGATTTTCCGCGGTTCAAGCGACCGGACCGTGACATCGGTCAGTTGCCGCCGGATAACCTGCGCGGGCCTAGT 
CCTGGAGCCAGACCAGGCTGGCAAAACGGCCGGTACGCGACGAGCGGCGGTAGGAATAGAAGCGCGCGGTATCGCTGAAG 
GTGCAGAAGCCGCCGCCATGCACGGCGGTGACGCCATGGGCGCCCAGGCGGATCCGCGCGAGTCGGTAGATGTCGGCCAT 
GAAGCGGCCCGGATTGGCGCTAGGTACGAAAGCCGAGCGCGCCTCGGCGTGCGCAGCGACGAATGCATCGCGGACCTCGC 
CGCCGACCTCGAAGGCCTGCGGGCCGATCGCCGGCCCCAGCCAGACCAGCAGTTCGTCGCCGGGCACGCCCAGGCTGTCC 
ACCGTCGCCTCCAGCACGCCCGCCGCCAGCCCGCGCCAGCCGGCATGGGCCGCGGCCACCCGGGTGCCCGAGCGGTCGCA 
GAACAACGCC(MCAGGCAGTC(MCGGTCATGATCGTACAGGCGAC(KCCGGCATCGC(K:TC(^(X:T(^GTCGGCCCTGA 
GCACCGGTTCGGGTCGGCCTCCACCACGTCACTCCGTGCACCTATTCCAACCAGCTCGGCCGGCATTCCAGACGCTCGGT 
CAGGCGTCGGCGGTTTTATTCCACGGCGCGCGGATCGTCGTAGACGTGGGCGCCAAGGTTCAGACTGTCGAAGGGTGCCT 
GGCTGA 

>ORF26844c (SEQ ID NO:302) 

cgtggtggaggccgacccgaaccggtgctcagggccgacgccagctggagcgcgatgccgggcgtcgcctgtacgatcat 

gaccgccgactgcctgccggcgttgttctgcgaccgctcgggcacccgggtggccgcggcccatgccggctggcgcgggc 

tggcggcgggcgtgctggaggcgacggtggacagcctgggcgtgcccggcgacgaactgctggtctggctggggccggcg 

atcggcccgcaggccttcgaggtcggcggcgaggtccgcgatgcattcgtcgctgcgcacgccgaggcgcgctcggcttt 

cgtacctagc(k:caatccgggccgcttcatggccgacatctaccgactcgcgcggatccgcctgggcgc^^^ 

ccgccgtgcatggcggcggcttctgcaccttcagcgataccgcgcgcttctattcctaccgccgctcgtcgcgtaccggc 

cgttttgccagcctggtctggctccaggactag 

>orf26486 (seq id no:304) 

atgtcggccatgaagcggcccggattggcgctaggtacgaaagccgagcgcgcctcggcgtgcgcagcgacgaatgcatc 
gcggacctcgccgccgacctcgaaggcctgcgggccgatcgccggccccagccagaccagcagttcgtcgccgggcacgc 
ccaggctgtccaccgtcgcctccagcacgcccgccgccagcccgcgccagccggcatgggccgcggccacccgggtgccc 
gagcggtcgcagaacaacgccggcaggcagtcggcggtcatgatcgtacaggcgacgcccggcatcgcgctccagctggc 
gtcggccctgagcaccggttcgggtcggcct^ 

agacgctcggtcaggcgtcggcggttttattccacggcgcgcggatcgtcgtagacgtgggcgccaaggttcagactgtc 
gaagggtgcctggctgaccccgccactgcgcgtggtcacgcaggcccgcacacgggccggcgccggccagtcgggggtca 
gccaggcgttcaaccgacgaacgcctcgcgatcctggcgcaacaggctgagcagccagaggaattcttccggcagcggcg 
attcccacttcatgcgcacgccggtggccgggtga 

>ORF26857c (SEQ ID NO:306) 

GTGCACGGAGTGACGTGGTGGAGGCCGACCCGAACCGGTGCTCAGGGCCGACGCCAGCTGGAGCGCGATGCCGGGCGTCG 
CCTGTACGATCATGACCGCCGACTGCCTGCCGGCGTTGTTCTGCGACCGCTCGGGCACCCGGGTGGCCGCGGCCCATGCC 
GGCTGGCGCGGGCTGGCGGCGGGCGTGCTGGAGGCGACGGTGGACAGCCTGGGCGTGCCCGGCGACGAACTGCTGGTCTG 
GCTGGGGCCGGCGATCGGCCCGCAGGCCTTCGAGGTCGGCGGCGAGGTCCGCGATGCATTCGTCGCTGCGCACGCCGAGG 
CGCGCTCGGCTTTCGTACCTAG 

>ORF27314c (SEQ ID NO:308) 

AGTGGGAATCGCCGCTGCCGGAAGAATTCCTCTGGCTGCTCAGCCTGTTGCGCCAGGATCGCGAGGCGTTCGTCGGTTGA 
ACGCCTGGCTGACCCCCGACTGGCCGGCGCCGGCCCGTGTGCGGGCCTGCGTGACCACGCGCAGTGGCGGGGTCAGCCAG 
GCACCCTTCGACAGTCTGAACCTTGGCGCCCACGTCTACGACGATCCGCGCGCCGTGGAATAAAACCGCCGACGCCTGAC 
CGAGCGTCTGGAATGCCGGCCGAGCTGGTTGGAATAGGTGCACGGAGTGACGTGGTGGAGGCCGACCCGAACCGGTGCTC 
AGGGCCGACGCCAGCTGGAGCGCGATGCCGGGCGTCGCCTGTACGATCATGA 
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>ORF27730c (SEQ ID NO:310) 

caa6cccgcc6gcct66t6gtccatcc66ctgccg6ccatca6gacggcaccct6ctgaat6ccttgctctaccatgtcc 
cggacatcgccaatgtgccgcgcgccgggatcgtccaccgcctggacaaggacacgaccggcctgatggtagtggccaag 
acgct^a^cacaccmgctggtggcgcaact^^ 

cgtgatcacctccggcggcaccatcgatgcgccgatcggacggcatggcgtgcagcggcagaagatggcggtggtcgacg 
ccggcaaggtggcggtcagccattaccgcgtgctggaacgcttccgtgcgcacacccatacccgggtcaagctggagacc 

GGGCGTACCCACCAGATCCGCGTGCACATGAGCCATATTGGCTATCCCCTGGTCGGCGATCCGGTCTACGGTGGGCGCTT 
CAGGATTCCCCCGGTGGCCAGCCAGACCCTGGTCCAGACTCTTCGCGAATTCCCCCGGCAGGCGCTGCACGCGCGCTTCC 
TCGAACTGGATCACCCGGCCACCGGCGTGCGCATGAAGTGGGAATCGCCGCTGCCGGAAGAATTCCTCTGGCTGCTCAGC 
CTGTTGCGCCAGGATCGCGAGGCGTTCGTCGGTTGAACGCCTGGCTGACCCCCGACTGGCCGGCGCCGGCCCGTGTGCGG 
GCCTGCGTGACCACGCGCAGTGGCGGGGTCAGCCAGGCACCCTTCGACAGTCTGAACCTTGGCGCCCACGTCTACGACGA 
TCCGCGCGCCGTGGAATAA 

>ORF26983 (SEQ ID NO:312) 

CCCCGCCACTGCGCGTGGTCACGCAGGCCCGCACACGGGCCGGCGCCGGCCAGTCGGGGGTCAGCCAGGCGTTCAACCGA 
<mCGCCTOKX»TOTG^ 

ACGCCGGTGGCCGGGTGATCCAGTTCGAGGAAGCGCGCGTGCAGCGCCTGCCGGGGGAATTCGCGAAGAGTCTGGACCAG 
GGTCTGGCTGGCCACCGGGGGAATCCTGAAGCGCCCACCGTAGACCGGATCGCCGACCAGGGGATAGCCAATATGGCTCA 
TGTGCACGCGGATCTGGTGGGTACGCCCGGTCTCCAGCTTGACCCGGGTATGGGTGTGCGCACGGAAGCGTTCCAGCACG 
CGGTAATGGCTGACCGCCACCTTGCCGGCGTCGACCACCGCCATCTTCTGCCGCTGCACGCCATGCCGTCCGATCGGCGC 
ATCGATGGTGCCGCCGGAGGTGATCACGCCGATCACGATCGCCTCGTAGATGCGGCTGACCGACCGTGCCTGCAGTTGCG 
CCACCAGCTTGGTGTGGGCCTCCAGCGTCTTGGCCACTACCATCAGGCCGGTCGTGTCCTTGTCCAGGCGGTGGACGATC 
CCGGCGCGCGGCACATTGGCGATGTCCGGGACATGGTAGAGCAAGGCATTCAGCAGGGTGCCGTCCTGATGGCCGGCAGC 
CGGATGGACCACCAGGCCGGCGGGCTTGTCAATCACCAGGATGTGCTCGTCCTCGTAGACGATTTCCAGCTCGATGTCCT 
GTGCGAGCCACTCGCCCTGGGCTTCCTGCTCGGCCTCCAGGACCAGTTGCGCGCCGCTGTGGACGATGTCGCGCGGGCGC 
AGCACGGCGCCGTCGACGGTCAGGCGACCGTCCTTGATCCAGCCGGCCAGACGGGAGCGGGAGTGTTCGGGAAAAAGCTG 
GGCGGCGATCTGGTCGAGACGCTGGCCACCCAGCTCGAACGGCACCTCGGCCGCGCGTTGAATCATATCGGACATGAGTA 
G 

>ORF28068c (SEQ ID NO:314) 

CCACAGCGCGTAGCCGATTCCAAAAGCCGCGCTGAGCATCGTCTCCTACTCATGTCCGATATGATTCAACGCGCGGCCGA 
GGTGCCGTTCGAGCTGGGTGGCCAGCGTCTCGACCAGATCGCCGCCCAGCTTTTTCCCGAACACTCCCGCTCCCGTCTGG 
CCGGCTGGATCAAGGACGGTCGCCTGACCGTCGACGGCGCCGTGCTGCGCCCGCGCGACATCGTCCACAGCGGCGCGCAA 
CTGGTCCTGGAGGCCGAGCAGGAAGCCCAGGGCGAGTGGCTCGCACAGGACATCGAGCTGGAAATCGTCTACGAGGACGA 
GCACATCCTGGTGATTGACMGCCCGCC(MCCT(MTGGTCCATCC(MCTGCCGGCCATCA(MACGGCACCCT(K:TGAATG 
CCTTGCTCTACCATGTCCCGGACATCGCCAATGTGCCGCGCGCCGGGATCGTCCACCGCCTGGACAAGGACACGACCGGC 
CTGATGGTAGTGGCCAAGACGCTGGAGGCCCACACCAAGCTGGTGGCGCAACTGCAGGCACGGTCGGTCAGCCGCATCTA 
CGAGGCGATCGTGATCGGCGTGATCACCTCCGGCGGCACCATCGATGCGCCGATCGGACGGCATGGCGTGCAGCGGCAGA 
AGATGGCGGTGGTCGACGCCGGCAAGGTGGCGGTCAGCCATTACCGCGTGCTGGAACGCTTCCGTGCGCACACCCATACC 
CGGGTCAAGCTGGAGACCGGGCGTACCCACCAGATCCGCGTGCACATGAGCCATATTGGCTATCCCCTGGTCGGCGATCC 
GGTCTACGGTGGGCGCTTCAGGATTCCCCCGGTGGCCAGCCAGACCCTGGTCCAGACTCTTCGCGAATTCCCCCGGCAGG 
CGCTGCACGCGCGCTTCCTCGAACTGGATCACCCGGCCACCGGCGTGCGCATGAAGTGGGAATCGCCGCTGCCGGAAGAA 
TTCCTCTGGCTGCTCAGCCTGTTGCGCCAGGATCGCGAGGCGTTCGTCGGTTGA 

>ORF27522 (SEQ ID NO:316| 

CCGACCGTGCCTGCAGTTGCGCCACCAGCTTGGTGTGGGCCTCCAGCGTCTTGGCCACTACCATCAGGCCGGTCGTGTCC 
TTGTCCAGGCGGTGGACGATCCCGGCGCGCGGCACATTGGCGATGTCCGGGACATGGTAGAGCAAGGCATTCAGCAGGGT 
GCCGTCCTGATGGCCGGCAGCCGGATGGACCACCAGGCCGGCGGGCTTGTCAATCACCAGGATGTGCTCGTCCTCGTAGA 
CGATTTCCAGCTCGATGTCCTGTGCGAGCCACTCGCCCTGGGCTTCCTGCTCGGCCTCCAGGACCAGTTGCGCGCCGCTG 
TGGACGATGTCGCGCGGGCGCAGCACGGCGCCGTCGACGGTCAGGCGACCGTCCTTGA 
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>ORF28033c (SEQ ID NO:318] 

GCATCGTCTCCTACTCATGTCCGATATGATTCAAC6CGC66CCGA66TGCC6TTC6A6CTG6GT66CCA6C6TCTC6ACC 
AGATCGCCGCCCAGCTTTTTCCCGAACACTCCCGCTCCCGTCTGGCCGGCTGGATCAAGGACGGTCGCCTGACCGTCGAC 
GGCGCCGTGCTGCGCCCGCGCGACATCGTCCACAGCGGC(K:GCAACTGGTCCTGGA(K3CCGA(^GGAA(^CCAG(^GA 
GTGGCTCGCACAGGACATCGAGCTGGAAATCGTCTACGAGGACGAGCACATCCTGGTGATTGA 

>ORF29701c (SEQ ID NO:320) 

TCTTCCAGTTCGCTGGAGATCAGCAGGACCAGTACCAGGCCGATGGTCAGGCGGTACAGGTGGTACAGACGGAGGATGCG 
TTGCCCCTGCTCCTCGCTCAGCCGTAGCCGTTCAGCGCGCACGGTCGCCCTGGTCCTGGCGCAGGTGCGCCTGGCTGCAA 
TACCAGCGTTGTTCGTGGGCGAGGGCGTTGGCCTGCGGCACGTGGACGCCGCAATGGGCGCAGCGGACCATCGGCGATGC 
GCTCGGCTCGTCCTGCGGACGTTGCTGCTGGCGCGGAGTGGGACGGGTAAAGCGACGCCAGAGCCAGAACGCGATGGCGA 
TCAGGGCGATCCAGAACAGGAGGCGGAAAAGGCCCATGGTGATCTCGGAGGCTGGAGAAAGCTGCAGTTTAGCCAAGCCG 
CCGGCTCGATCCCAGACGGGAAGGTCCAGGCTGTGCGGCGTTTGGCGCTGGGAGAGGCATGGCGGCGGGCAAAAAGAAGG 
GAGGCCTGCGCCTCCCTTCGGTGTTTCGTGCGATCAGTCGAAGAGACCGAAGGTCATGTAGCTCCACCAGGAGCGACCGG 
AGTCCTCGTCGTCATCGCTCTCCGGCTTCTCGTCGTCGGCGCTGTGATCCTGGTTTTCCGGCTTCAGTTCGGCGGGGATC 
TCCCGCTCGGCATCCTCGTACTGCTTGATCACGTCCTTGGCGGCCTGGGTTTCCATGTGCGGCGGCGGCTCGCCGCCTTC 
GATCAGGCCCAGGGTGGCCTTGGCCAGCCAGGAGCGGGTGTCGGCCTCGCTTTCGCGGGCGACGAACTCGCCATCCTTGA 
GGCTGGCGTTATCCGGATAGTTCAGCTTGAGGGTTTCCAGGCTGGTGCTGGCCAGGTCGTCGAGACCCAGGCGACGGTAG 
GCTTCGACCATGATCGCCAGGCCATCGCCGACGGCCGGGGTTTCCTGGAAGTTCTCCACCACGTAGCGACCGCGGTTGGC 
GGCGGCGACATAGGCCTGGCGCTTCAimGTAGTGGCC&CGTGM 

CCATGCGCGCCTTGGCGTCCGGGGCGTAGCGGCTGTTGGGGAAGCGGCTGGTGAGCTGGGCGAACTCGTTGAAGGAGTCG 
CGGGCGGCGCCCGGGTCGCGCTTGGTCATGTCCAGCGGCAGGAAGCGCGCCAGCAGGCCGCGGTCCTGGTCGAAGGAGGA 
CAGGCCTTTGAGGTAGTAGGCGTAGTCGACGTTGGGGTGCTGCGGATGCAGGCGGATGAAGCGTTCGGCGGCGGCGCGGG 
CGGCTTCGGGCTCCATGTTCTTGTAGTTGGCGTAGATCAGCTCGAGCTGGGCCTGCTCGGCGTAGCGGCCGAAGGGATAG 
CGCGATTCGAGGGCTTTCAGCTTGGTGACGGCGCTGTTGTAGCTCTTGTTGTTGAGGTCGTCCTGCGCCTGCTGGTACAG 
CTGGCTCTCGCTCAGGTTCTCGTCGACAGTCTCCTTGTTCGAGGAGCAGGCTGCGGTGAGGGCGAGGATGGCGATCAGCA 
GCAGGTGTTTCACTTGCATGGCGGCTTGCGTCCCTGGGACGGTCGGCTTGGCCTCAACCGTCTGTTATGA 

>ORF28118 (SEQ ID NO:322) 

CAGACGGTTGAGGCCAAGCCGACCGTCCCAGGGACGCAAGCCGCCATGCAAGTGAAACACCTGCTGCTGATCGCCATCCT 

CGCCCTCACCGCAGCCTGCTCCTCGAACAAGGAGACTGTCGACGAGAACCTGAGCGAGAGCCAGCTGTACCAGCAGGCGC 

AGGACGACCTCAACAACAAGAGCTACAACAGCGCCGTCACCAAGCTGAAAGCCCTCGAATCGCGCTATCCCTTCGGCCGC 

TACGCCGAGCAGGCCCAGCTCGAGCTGATCTACGCCAACTACAAGAACATGGAGCCCGAAGCCGCCCGCGCCGCCGCCGA 

ACGCTTCATCCGCCTGCATCCGCAGCACCCCAACGTCGACTACGCCTACTACCTCAAAGGCCTGTCCTCCTTCGACCAGG 

ACCGCGGCCTGCTGGCGCGCTTCCTGCCGCTGGACATGACCAAGCGCGACCCGGGCGCCGCCCGCGACTCCTTCAACGAG 

TTCGCCCAOTCACCA(X:CGCTTCCCCAACAGCCGCTAC(^CCC(WACGCCAAGGCGCGCATGGTGTACCTGCG 

GCTGGCGGCCTACGAAGTGCACGTCGGCCACTACTACCTGAAGCGCCAGGCCTATGTCGCCGCCGCCAACCGCGGTCGCT 

ACGTGGTGGAGAACTTCCAGGAAACCCCGGCCGTCGGCGATGGCCTGGCGATCATGGTCGAAGCCTACCGTCGCCTGGGT 

CGTCGCCCGCGAAAGCGAGGCCGACACCCGCTCCTGGCTGGCCAAGGCCACCCTGGGCCTGATCGAAGGCGGCGAGCCGC 
CGCCGCACATGGAAACCCAGGCCGCCAAGGACGTGATCAAGCAGTACGAGGATGCCGAGCGGGAGATCCCCGCCGAACTG 
AAGCCGGAAAACCAGGATCACAGCGCCGACGACGAGAAGCCGGAGAGCGATGACGACGAGGACTCCGGTCGCTCCTGGTG 
GAGCTACATGACCTTCGGTCTCTTCGACTGA 

>ORF28129 (SEQ ID NO:324) 

GGCCAAGCCGACCGTCCCAGGGACGCAAGCCGCCATGCAAGTGAAACACCTGCTGCTGATCGCCATCCTCGCCCTCACCG 
CAGCCTGCTCCTCGAACAAGGAGACTGTCGACGAGAACCTGAGCGAGAGCCAGCTGTACCAGCAGGCGCAGGACGACCTC 
AACAACAAGAGCTACAACAGCGCCGTCACCAAGCTGAAAGCCCTCGAATCGCGCTATCCCTTCGGCCGCTACGCCGAGCA 
GGCCCAGCTCGAGCTGATCTACGCCAACTACAAGAACATGGAGCCCGAAGCCGCCCGCGCCGCCGCCGAACGCTTCATCC 
GCCTGCATCCGCAGCACCCCAACGTCGACTACGCCTACTACCTCAAAGGCCTGTCCTCCTTCGACCAGGACCGCGGCCTG 
CTGGCGCGCTTCCTGCCGCTGGACATGACCAAGCGCGACCCGGGCGCCGCCCGCGACTCCTTCAACGAGTTCGCCCAGCT 
CACCAGCCGCTTCCCCAACAGCCGCTACGCCCCGGACGCCAAGGCGCGCATGGTGTACCTGCGCAACCTGCTGGCGGCCT 
ACGAAGTGCACGTCGGCCACTACTACCTGAAGCGCCAGGCCTATGTCGCCGCCGCCAACCGCGGTCGCTACGTGGTGGAG 
AACTTCCAGGAAACCCCGGCCGTCGGCGATGGCCTGGCGATCATGGTCGAAGCCTACCGTCGCCTGGGTCTCGACGACCT 
GGCCAGCACCAGCCTGGAAACCCTCAAGCTGAACTATCCGGATAA 
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>ORF29709c (SEQ ID NO.-326) 

GGACCTGATCTTCCA6TTCGCTGGAGATCA6CA6GACCAGTACCA6GCC6ATGGTCAG6CGGTACAGGTGGTACAGACGG 
AGGATGCGTTGCCCCTGCTCCTCGCTCAGCCGTAGCCGTTCAGCGCGCACGGTCGCCCTGGTCCTGGCGCAGGTGCGCCT 
GGCTGCAATACCAGCGTTGTTCGTGGGCGAGGGCGTTGGCCTGCGGCACGTGGACGCCGCAATGGGCGCAGCGGACCATC 
GGCGATGCGCTCGGCTCGTCCTGCGGACGTTGCTGCTGGCGCGGAGTGGGACGGGTAAAGCGACGCCAGAGCCAGAACGC 
GATGGCGATCAGGGCGATCCAGAACAGGAGGCGGAAAAGGCCCATGGTGATCTCGGAGGCTGGAGAAAGCTGCAGTTTAG 
CCAAGCCGCCGGCTCGATCCCAGAC(mA(&ra^ 

AAAGAAGGGAGGCCTGCGCCTCCCTTCGGTGTTTCGTGCGATCAGTCGAAGAGACCGAAGGTCATGTAG 
>ORF29189 (SEQ ID NO:328) 

TCGCACGAAACACCGAAGGGAGGCGCAGGCCTCCCTTCTTTTTGCCCGCCGCCATGCCTCTCCCAGCGCCAAACGCCGCA 
CAGCCTGGACCTTCCCGTCTGGGATCGAGCCGGCGGCTTGGCTAAACTGCAGCTTTCTCCAGCCTCCGAGATCACCATGG 
GCCTTTTCCGCCTCCTGTTCTGGATCGCCCTGATCGCCATCGCGTTCTGGCTCTGGCGTCGCTTTACCCGTCCCACTCCG 
CGCCAGCAGCAACGTCCGCAGGACGAGCCGAGCGCATCGCCGATGGTCCGCTGCGCCCATTGCGGCGTCCACGTGCeGCA 
GGCCAACGCCCTCGCCCACGAACAACGCTGGTATTGCAGCCAGGCGCACCTGCGCCAGGACCAGGGCGACCGTGCGCGCT 
GA 

>ORF29382 (SEQ ID NO:330) 

TCGCCATCGCGTTCTGGCTCTGGCGTCGCTTTACCCGTCCCACTCCGCGCCAGCAGCAACGTCCGCAGGACGAGCCGAGC 
GCATCGCCGATGGTCCGCTGCGCCCATTGCGGCGTCCACGTGCCGCAGGCCAACGCCCTCGCCCACGAACAACGCTGGTA 
TTGCAGCCAGGCGCACCTGCGCCAGGACCAGGGCGACCGTGCGCGCTGAACGGCTACGGCTGAGCGAGGAGCAGGGGCAA 
CGCATCCTCCGTCTGTACCACCTGTACCGCCTGACCATCGGCCTGGTACTGGTCCTGCTGATCTCCAGCGAACTGGAAGA 
TCAGGTCCTCAAGCTCGTCCACCCTGAACTGTTCCATGTCGGCAGTTGGTGCTACCTGGTCTTCAACATCCTGGTCGCGC 
TGTTCCTGCCGCCGTCGCGGCAATTGCTGCCGATCTTCATCCTCGCGCTCACCGACGTGCTGATGCTTTGCGGCCTGTTC 
TACGCAGGTGGCGGCGTACCCAGCGGCATCGGCAGCCTGCTGGTGGTGGCGGTGGCCATTGCCAACATCCTGCTGCGCGG 
GCGCATCGGCCTGGTCATCGCGGCGGCGGCCAGCCTCGGOT 

ACGCCACCAACCACTACGTCCAGGCCGGCGGCCTCGGCACCCTGTGCTTCGCCGCCGCGCTGGTGATCCAGGCTCTGGTG 
CGGCGCCAGGAGCAGACCGAAACGCTGGCCGAAGAACGCGCCGAGACGGTCGCCAACCTGGAGGAACTCAACGCATTGAT 
CCTGCAGCGCATGCGCACCGGCATCCTCGTGGTCGATAGCCGTCAGGCCATCCTCCTCGC(^CCAGGCCGCCCTCGGCC 
TGCTCAGGCAGGACGACGTGCAGGGCGCCAGCCTCGGCCGCCACAGCCCGATGCTGATGCACTGCATGAAGCAATGGCGC 
CTGAATCCCAGCCTCCGTCCGCCGACGCTCAAGGTGGTGCCGGATGGCCCGACGGTGCAACCCAGCTTTATCAGCCTCAA 
CCGCGAAGACGACCAGCACGTGCTGATCTTCCTCGAAGACATTTCGCAGATCGCC^GCAGGCGCAKAGATGAAGCTGG 
CCGGTCTTGGCCGCCTGACCGCCGGCATCGCCCATGAGATCCGCAACCCGCTGGGCGCGATCAGCCACGCCGCCCAACTG 
CTGCAGGAGTCAGAGGAACTGGATGCCCCGGACCGACGCCTGACGCAGATCATCCAGGACCAGTCGAAGCGGATGAACCT 
GGTCATCGAGAACGTCCTGCAGCTCTCCCGTCGCCGCCAGGCCGAACCGCAGCAGCTCGACCTGAAGGAGTGGCTTCAGC 
(JGTTCGTCGACGAATACCCCGGCAGGCTGCGCAACGACAG^ 

CGCATGGACCCACACCAGTTGAACCAGGTGCTGAGCAACCTGGTGCAGAACGGTCTTCGCTACAGCGCCCAGGCGCACGG 
GCGCGGCCAGGTCTGGCTGAGCCTCGCGCGCGACCCGGAGAGCGACCTGCCGGTGCTGGAAGTCATCGACGACGGTCCCG 
GCGTACCGGCGGACAMCTGAAC^CCTGTTCGMCCCTTCT^ 

CTCTCCCGCGAACTCTGCGAGAGCAACCAGGCACGGATCGACTACCGCAATCGCGAGGAAGGCGGCGGCTGCTTCCGCAT 
CACCTTCGCCCACCCGCGCAAACTCAGCTGA 
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>ORF30590c (SEQ ID NO:332) 

CTCCT6CA6CA6TT6G6CG6C6TG6CT6ATCGC6CCCAGC6GGTT6CGGATCTCATGG6CGAT6CC6GC6GTCA66CG6C 
CAAGACCGGCCAGCTTCATCTGCTGCGCCTGCTGGGCGATCTGCGAAATGTCTTCGAGGAAGATCAGCACGTGCTGGTCG 
TCTTCGCGGTTGAGGCTGATAAAGCTGGGTTGCACCGTCGGGCCATCCGGCACCACCTTGAGCGTCGGCGGACGGAGGCT 
GGGATTCAGGCGCCATTGCTTCATGCAGTGCATCAGCATCGGGCTGTGGCGGCCGAGGCTGGCGCCCTGCACGTCGTCCT 
GCCTGAGCAGGCCGAGGGCGGCCTGGTTGGCGAGGAGGATGGCCTGACGGCTATCGACCACGAGGATGCCGGTGCGCATG 
CGCTGCAGGATCAATGCGTTGAGTTCCTCCAGGTTGGCGACCGTCTCGGCGCGTTCTTCGGCCAGCGTTTCGGTCTGCTC 
CT(MCGCCGCACCAGAGCCTGGATCACCA(K:GCGGCGGCGAAGCACAG(MTGCCGAGGCCGCCGGCCTGGACGTAGT(WT 
TGGTGGCGTCCGGACTGCTCAGGCTGAGGAAGAAGGTCAGGTAGAGCAGGCCGAGGCTGGCCGCCGCCGCGATGACCAGG 
CCGATGCGCCCGCGCAGCAGGATGTTGGCAATGGCCACCGCCACCACCAGCAGGCTGCCGATGCCGCTGGGTACGCCGCC 
ACCTGCGTAGAACAGGCCGCAAAGCATCAGCACGTCGGTGAGCGCGAGGATGAAGATCGGCAGCAATTGCCGCGACGGCG 
GCAGGAACAGCGCGACCAGGATGTTGAAGACCAGGTAGCACCAACTGCCGACATGGAACAGTTCAGGGTGGACGAGCTTG 
AGGACCTGATCTTCCAGTTCGCTGGAGATCAGCAGGACCAGTACCAGGCCGATGGTCAGGCGGTACAGGTGGTACAGACG 
GAGGATGCGTTGCCCCTGCTCCTCGCTCAGCCGTAG 

>ORF29729 (SEQ ID NO:334) 
ACTGTTCCATGTCGGCAGTTGGTGCTACCTGGTCTT^ 

TGCCGATCTTCATCCTCGCGCTCACCGACGTGCTGATGCTTTGCGGCCTGTTCTACGCAGGTGGCGGCGTACCCAGCGGC 
ATCG(KAGCCTGCT(&TGGTGGCGGT<&CCATT(am^ 

GGCCAGCCTCGGCCTGCTCTACCTGACCTTCTTCCTCAGCCTGAGCAGTCCGGACGCCACCAACCACTACGTCCAGGCCG 

GCCGAAGAACGCGCCGAGACGGTCGCCAACCTGGAGGAACTCAACGCATTGATCCTGCAGCGCATGCGCACCGGCATCCT 
CGTGGTCGATAG 

>ORF30221 (SEQ ID NO:336) 

CCGTCAGGCCATCCTCCTCGCCAACCAGGCCGCCCTCGGCCTGCTCAGGCAGGACGACGTGCAGGGCGCCAGCCTCGGCC 
GCCACAGCCCGATGCTGATGCACTGCATGAAGCAATGGCGCCTGAATCCCAGCCTCCGTCCGCCGACGCTCAAGGTGGTG 
CCGGATGGCCCGACGGTGCAACCCAGCTTTATCAGCCTCAACCGCGAAGACGACCAGCACGTGCTGATCTTCCTCGAAGA 
CATTTCGCAGATCGCCCAGCAGGCGCAGCAGATGAA(K:TGGCC(WTCTTGGCCGCCT(^CCGCC(K;CATC(KCCATGA 

>ORF30736c (SEQ ID NO:338) 

AGCCACTCCTTCAGGTCGAGCTGCTGCGGTTCGGCCTGGCGGCGACGGGAGAGCTGCAGGACGTTCTCGATGACCAGGTT 
CATCCGCTTCGACTGGTCCTGGATGATCTGCGTCAGGCGTCGGTCCGGGGCATCCAGTTCCTCTGACTCCTGCAGCAGTT 
GGGCGGCGTGGCTGATCGCGCCCAGCGGGTTGCGGATCTCATGGGCGATGCCGGCGGTCAGGCGGCCAAGACCGGCCAGC 
TTCATCTGCTGCGCCTGCTGGGCGATCTGCGAAATGTCTTCGAGGAAGATCAGCACGTGCTGGTCGTCTTCGCGGTTGAG 
GCTGATAAAGCTGGGTTGCACCGTCGGGCCATCCGGCACCACCTTGAGCGTCGGCGGACGGAGGCTGGGATTCAGGCGCC 
ATTGCTTCATGCAGTGCATCAGCATCGGGCTGTGGCGGCCGAGGCTGGCGCCCTGCACGTCGTCCTGCCTGAGCAGGCCG 
AGGGCGGCCTGGTTGGCGAGGAGGATGGCCTGA 

>ORF30539 (SEQ ID NO:340) 

GATCCGCAACCCGCTGGGCGCGATCAGCCACGCCGCCCAACTGCTGCAGGAGTCAGAGGAACTGGATGCCCCGGACCGAC 
GCCTGACGCAGATCATCCAGGACCAGTCGAAGCGGATGAACCTGGTCATCGAGAACGTCCTGCAGCTCTCCCGTCGCCGC 
CAGGCCGAACCGCAGCAGCTCGACCTGAAGGAGTGGCTTCAGCGGTTCGTCGACGAATACCCCGGCAGGCTGCGCAACGA 
CAGCCAACTGCACCTGCAGCTCGGTGCCGGCGACATCCAGACCCGCATGGACCCACACCAGTTGAACCAGGTGCTGAGCA 
ACCTGGTGCAGAACGGTCTTCGCTACAGCGCCCAGGCGCACGGGCGCGGCCAGGTCTGGCTGAGCCTCGCGCGCGACCCG 
GAGAGCGACCTGCCGGTGCTGGAAGTCATCGACGACGGTCCCGGCGTACCGGCGGACAAACTGAACAACCTGTTCGAACC 
CTTCTTTACTACAGAAAGCAAAGGCACCGGCCTGGGCCTCTATCTCTCCCGCGAACTCTGCGAGAGCAACCAGGCACGGA 
TC(&CTACC(mTCGCGAGGAAGGC(£CraGCTTCCG(^ 

CCGCACGCATGAGCCGACAAAAAGCCCTGATCGTCGACGATGAACCGGATATCCGCGAACTGCTGGAAATCACTCTCGGC 
CGCATGAAGCTGGACACCCGCAGCGCCCGCAACGTCAAGGAAGCCGCGAGTTGCTGGCCCGCGAGCCGTTCGACCTGTGC 
CTCACCGACATGCGCCTGCCGGACGGCAGCGGCCTCGATCTGGTCCAGTACATCCAGCAGCGCCATCCACAGACCCCGGT 
GGCCATGA 
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>ORF31247c (SEQ ID NO:342) 

TTTCCAGCAGTTCGCGGATATCCG6TTCATC6TC6AC6ATCA6G6CTTTTTGTCGGCTCATGC6TGCG6CTTCCGTCAGC 
TGAGTTTGCGCGGGTGGGCGAAGGTGATGCGGAAGCAGCCGCCGCCTTCCTCGCGATTGCGGTAGTCGATCCGTGCCTGG 
TTGCTCTCGCAGAGTTCGCGGGAGAGATAGAGGCCCAGGCCGGTGCCTTTGCTTTCTGTAGTAAAGAAGGGTTCGAACAG 
GTTGTTCAGTTTGTCCGCCGGTACGCCGGGACCGTCGTCGATGACTTCCAGCACCGGCAGGTCGCTCTCCGGGTCGCGCG 
CGAGGCTCAGCCAGACCTGGCCGCGCCCGTGCGCCTGGGCGCTGTAGCGAAGACCGTTCTGCACCAGGTTGCTCAGCACC 
T(WTTCAACT(MTGT(MGTCCATGCGGGTCTGGATGTCGCC(MCACCGA(K:TGCAGGT(}^GTTGGCTGTCGTTGCGCAG 
CCTGCCGG(MTATTCGTCGACGAACCGCTGAAGCCACTCCTTCA(MTCGAGCTGCTGC{KJTTCG(K:CTGGCGGCGACGGG 
AGAGCTGCAGGACGTTCTCGATGACCAGGTTCATCCGCTTCGACTGGTCCTGGATGATCTGCGTCAGGCGTCGGTCCGGG 
GCATCCAGTTCCTCTGA 

>ORF30963c (SEQ ID NO:344) 

CTTCCAGCACCGGCAGGTCGCTCTCCGGGTCGCGCGCGAGGCTCAGCCAGACCTGGCCGCGCCCGTGCGCCTGGGCGCTG 
TAGCGAAGACCGTTCTGCACCAGGTTGCTCAGCACCTGGTTCAACTGGTGTGGGTCCATGCGGGTCTGGATGTCGCCGGC 
ACCGAGCTGCAGGTGCAGTTGGCTGTCGTTGCGCAGCCTGCCGGGGTATTCGTCGACGAACCGCTGAAGCCACTCCTTCA 
GGTCGAGCTGCTGCGGTTCGGCCTGGCGGCGACGGGAGAGCTGCAGGACGTTCTCGATGA 

>ORF31539c (SEQ ID NO.-346) 

GGCGGTTGCCACCAGCTCCCGCAAGCGACCGAGGTCGACCGGTTTGGTGAGGAAGTCGAAGGCACCGGCCTTGAGCGCCT 

GGATCGCGGTGTCCAGGCTGCCGTACO:GGTGATCATGGCCACC(MGGTCTGTGGATGGCGCTOTGG^^ 

AGATCGAGGCCGCTGCCGTCCGGCAGGCGCATGTCGGTGAGGCACAGGTCGAACGGCTCGCGGGCCAGCAACTCGCGGCT 

TCCTTGACGTTGCGGGCGCTGCGGGTGTCCAGCTTCATGCGGCCGAGAGTGATTTCCAGCAGTTCGCGGATATCCGGTTC 

ATCGTCGACGATCAGGGCTTTTTGTCGGCTCATGCGTGCGGCTTCCGTCAGCTGA 

>ORF31222 (SEQ ID NO:348) 
ACCGGATATCCGC(3AACT<KTGGAAATCACTCTCGOT 

CCGCGAGTTGCTGGCCCGCGAGCCGTTCGACCTGTGCCTCACCGACATGCGCCTGCCGGACGGCAGCGGCCTCGATCTGG 
TCCAGTACATCCAGCAGCGCCATCCACAGACCCCGGTGGCCATGATCACCGCGTACGGCAGCCTGGACACCGCGATCCAG 
GCGCTCAAGGCCGGTGCCTTCGACTTCCTCACCAAACCGGTCGACCTCGGTCGCTTGCGGGAGCTGGTGGCAACCGCCCT 
ACGCTTGCGCAACCC(MAAGCCGA(WAAGCGCCGGTGGACAACCGCCTGCTCGGCGAGTCGCCGCCGAT(K:GCGCCCT(K 
GCAACCAGATCGGCAAGCTGGCGCGCAGCCAGGCGCCGGTCTACATCAGTGGCGAGTCCGGCAGCGGCAAGGAACTGGTG 
GCGCGCCTGATCCACGAGCAGGGGCCACGTATCGAGCGGCCGTTCGTGCCGGTGAACTGCGGCGCGATTCCCTCCGAGCT 
GATGGAAAGCGAGTTCTTCGGCCACAAGAMGGCAGCTTCACTGGCGCTATCGAAGAC^GCAGGGCCTGTTCCAGGCCG 
CCAGCGGTGGCACCCTGTTCCTCGACGAAGTCGCCGACCTGCCGATGGCCATGCAGGTCAAACTGCTCCGGGCGATCCAG 
GAAAAGGCCGTGCGCGCGGTCGGCGGCCAGCAGGAGGTCGCCGTCGCACGTGCGCATCCTCTGCGCCACCCACAAGGACC 
TCGCCGCCGAAGTCGGCGCCGGGCGCTTCCGCCAGGACCTCTACTACCGCCTCAACGTCATCGAGCTGCGCGTACACCGC 
TGCGCGAACGCCGCGAGGACATCCCGCTGCTCGCCGAACGCATCCTCAAGCGCCTGGCCGGCGACACCGGCCTGCCGGCC 
GCCAGGCTGACCGGCGACGCACAGGAGAAGCTGAAGAACTACCGCTTCCCGGGCAACGTCCGCGAGCTGGAAAACATGCT 
GGAGCGCGCCTATACCCTGTGCGAAGACGACCAGATCCAGCCTCACGACCTGCGCCTGGCCGATGCGCCGGGTGCCAGCC 
AGGAAGGCGCCGCGAGCCTGAGCGAAATCGACAACCTCGAGGACTACCTGGAAGACATCGAGCGCAAGCTGATCATGCAG 
GCACTCGAGGAGACCCGCTGGAACCGCACCGCCGCGGCCCAGCGCCTGGGCCTGACGTTCCGCTCGATGCGCTACCGCCT 
GAAAAAGCTGGGCATCGACTGA 
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>ORF31266 (SEQ ID NO.-350) 

AGCTGGACACCCGCASC6CCCGCAAC6TCAAGGAA6CCGC6AGTT6CT6GCCCGCGA6CCGTTC6ACCTGT6CCTCACC6 

ACATGC(K:CTGCCGGACGGCAGCGGCCTCGATCTGGTCCAGTACATCCAGCAGCGCCATCCACAGACCCC(M 

ATCACCGCGTACGGCAGCCTGGACACCGCGATCCAGGCGCTCAAGGCCGGTGCCTTCGACTTCCTCACCAAACCGGTCGA 

CCTCGGTCGCTTGCGGGAGCTGGTGGCAACCGCCCTACGCTTGCGCAACCCGGAAGCCGAGGAAGCGCCGGTGGACAACC 

GCCTGCTCGGCGAGTCGCCGCCGATGCGCGCCCTGCGCAACCAGATCGGCAAGCTGGCGCGCAGCCAGGCGCCGGTCTAC 

ATCAGT(£C(&GTCCGGCAGCGGCAA(mCTGGTG(mG 

CGTGCCGGTGAACTGCGGCGCGATTCCCTCCGAGCTGATGGAMGCGAGTTCTTCGGCGICAAGAAAGGCAGCTTCACTG 
GCGCTATCGAAGACAAGCAGGGCCTGTTCCAGGCCGCCAGCGGTGGCACCCTGTTCCTCGACGAAGTCGCCGACCTGCCG 
ATGGCCATGCAGGTCAAACTGCTCCGGGCGATCCAGGAAAAGGCCGTGCGCGCGGTCGGCGGCCAGCAGGAGGTCGCCGT 
CGCACGTGCGCATCCTCTGCGCCACCCACAAGGACCTCGCCGCCGAAGTCGGCGCCGGGCGCTTCCGCCAGGACCTCTAC 
TACCGCCTCAACGTCATCGAGCTGCGCGTACACCGCTGCGCGAACGCCGCGAGGACATCCCGCTGCTCGCCGAACGCATC 

CTTCCCGGGCAACGTCCGCGAGCTGGAAAACATGCTGGAGCGCGCCTATACCCTGTGCGAAGACGACCAGATCCAGCCTC 
ACGACCTGCGCCTIMCCGATGCGCC^T^CAKCAGGAAGGCGCCGCGAGCCTGAGCGAAATCGACAACCTCGAGGAC 
TACCT(mGACATC(&GCGCAAGCTGATCATGCAG(^ 

CCTGGGCCTGACGTTCCGCTCGATGCGCTACCGCCTGAAAAAGCTGGGCATCGACTGAAAGTGAAAAGGCCTGTCCGAAG 
ACAGGCCTTTTGGTTTTCGCTCCTCAGAGGCGACCAGCCGGGGCGTAGGGGGCCGGGTCGATGA 

>ORF31661c (SEQ ID NO:352) 
ACC(£CGCCTGGCTGCGC(mGCTT(mATCTGGT^^^ 

GTCCACCGGCGCTTCCTCGGCTTCCGGGTTGCGCAAGCGTAGGGCGGTTGCCACCAGCTCCCGCAAGCGACCGAGGTCGA 
CCGGTTTGGTGAGGAAGTCGAAGGCACCGGCCTTGAGCGCCTGGATCGCGGTGTCCAGGCTGCCGTACGCGGTGATCATG 
GCCACCGGGGTCTGTGGATGGCGCTGCTGGATGTACTGGACCAGATCGAGGCCGCTGCCGTCCGGCAGGCGCATGTCGGT 
GAGGCACAGGTCGAACGGCTCGCGGGCCAGCAACTCGCGGCTTCCTTGA 

>ORF32061c (SEQ ID NO:354) 

AGGTCCTGGCGGAAGCGCCCGGCGCCGACTTCGGCGGCGAGGTCCTTGTGGGTGGCGCAGAGGATGCGCACGTGCGACGG 

CGACCTCCTGCTGGCCGCCGACCGCGCGCACGGCCTTTTCCTGGATCGCCCGGAGCAGTTTGACCTGCATGGCCATCGGC 

AGGTCGGCGACTTCGTCGAGGAACAGGGTGCCACCGCTGGCGGCCTGGAACAGGCCCTGCTTGTCTTCGATAGCGCCAGT 

GAAGCTGCCTTTCTTGTGGCCGAAGAACTCGCTTTCCATCAGCTCGGAGGGAATCGCGCCGCAGTTCACCGGCACGAACG 

GCCGCTCGATACGTGGCCCCTGCTCGTGGATCAGGCGCGCCACCAGTTCCTTGCCGCTGCCGGACTCGCCACTGATGTAG 

ACC(MCGCCTGGCTGCGCGCCAGCTTGCCGATCTGGTTGCGCAGGGCGCGCATCGGCGGCGACTCGCCGA(K:AG^ 

GTCCACCGGCGCTTCCTCGGCTTCCGGGTTGCGCAAGCGTAG 

>ORF32072c (SEQ ID NO:356) 

GGCGGTAGTAGAGGTCCTGGCGGAAGCGCCCGGCGCCGACTTCGGCGGCGAGGTCCTTGTGGGTGGCGCAGAGGATGCGC 
ACGTGCGACGGCGACCTCCTGCTGGCCGCCGACCGCGCGCACGGCCTTTTCCTGGATCGCCCGGAGCAGTTTGACCTGCA 
TGGCCATCGGCAGGTCGGCGACTTCGTCGAGGAACAGGGTGCCACCGCTGGCGGCCTGGAACAGGCCCTGCTTGTCTTCG 
ATAGCGCCAGTGAAGCTGCCTTTCTTGTGGCCGAAGAACTCGCTTTCCATCAGGTCGGAGGGAATCGCGCCGCAGTTCAC 
CGGCACGAACGGCCGCTCGATACGTGGCCCCTGCTCGTGGATCAGGCGCGCCACCAGTTCCTTGCCGCTGCCGGACTCGC 
CACTGATGTAG 

>ORF31784 (SEQ ID NO:358) 

TGGAAAGCGAGTTCTTCGGCCACAAGAAAGGCAGCTTCACTGGCGCTATCGAAGACAAGCAGGGCCTGTTCCAGGCCGCC 
AGCGGTGGCACCCTGTTCCTCGACGAAGTCGCCGACCTGCCGATGGCCATGCAGGTCAAACTGCTCCGGGCGATCCAGGA 
AAAGGCCGTGCGCGCGGTCGGCGGCCAGCAGGAGGTCGCCGTCGCACGTGCGCATCCTCTGCGCCACCCACAAGGACCTC 
GCCGCCGAAGTCGGCGCCGGGCGCTTCCGCCAGGACCTCTACTACCGCCTCAACGTCATCGAGCTGCGCGTACACCGCTG 
CGCGAACGCCGCGAGGACATCCCGCTGCTCGCCGAACGCATCCTCAAGCGCCTGGCCGGCGACACCGGCCTGCCGGCCGC 
CAGGCTGA 
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>ORF32568c (SEQ ID NO:360) 
GGAGCGAAAACCAAMGGCCTGTCTTCGGA(^GGCCTTT^ 

TCGAGCGGAACGTCAGGCCCAGGCGCTGGGCCGCGGCGGTGCGGTTCCAGCGGGTCTCCTCGAGTGCCTGCATGATCAGC 
TTGCGCTCGATGTCTTCCAGGTAGTCCTCGAGGTTGTCGATTTCGCTCAGGCTCGCGGCGCCTTCCTGGCTGGCACCCGG 
C(K!ATC(WCCAGGCGCAGGTCGTGA(^TGGATCTGGTCGTCTTCGCACAGGGTATAGGCGCGCTCCAGCATGTTTTCCA 
GCTCGCGGACGTTGCCCGGGAAGCGGTAGTTCTTCAGCTTCTCCTGTGCGTCGCCGGTCAGCCTGGCGGCCGGCAGGCCG 
GTGTCGCCGGCCAGGCGCTTGAGGATGCGTTCGGCGAGCAGCGGGATGTCCTCGCGGCGTTCGCGCAGCGGTGTACGCGC 
AGCTCGATGACGTTGAGGCGGTAG 

>ORF33157c (SEQ ID NO:362) 

acagacggaggtgcgcggctggttgcgcgacggcgatcgagtggtcggcgtggcgacctcgcgtggcgagatccgtggcg 
acaaggtgctgctggcggcaggcgcctggagcggcgagttgttgaagccgcttggcctggaactgcccgtggtaccggtg 
aaaggtcagatgatcctctacaagtk(mc(h3atttot^ 

gcggc(kgacg(k:cacatcct(»tcggcagcaccttggaacattc(^cttcgacaagacgccgaccgacga(kscgctg^ 
aaagcctcagggcgtctgcggcagaactgttgccggaactggcggacatgcagccggtggcccactgggcagggttgcgc 
ccgggctctcccgaaggcatcccctatatcggtccggtgcctggcttcgacgggctctggctgaataccgggcactaccg 
caacgggctggtcctggcaccggcgtcgtgccgtctgctggcggatctcatgagcgggcgggaaccgatcatcgacccgg 
ccccctacgccccggctggtcgcctctgaggagcgaaaaccaaaaggcctgtcttcggacaggccttttcactttcagtc 
gatgcccagctttttcaggcggtagcgcatcgagcggmcgtcaggcccaggcgctgggccgcggcggtgcggttccagc 
gggtctcctcgagtgcctgcatga 

>orf32530 (seq id no:364) 

aaaggcctgtccgaagacaggccttttggttttcgctcctcagaggcgaccagccggggcgtagggggccgggtcgatga 

tcggttcccgcccgctcatgagatccgccagcagacggcacgacgccggtgccaggaccagcccgttgcggtagtgcccg 

gtattcagccagagcccgtcgaagccaggcaccggaccgatataggggatgccttcgggagagcccgggcgcaaccctgc 

ccagtgggccaccggctgcatgtccgccagttccggcaacagttctgccgcagacgccctgaggctttccagcgcctcgt 

cggtcggcgtcttgtcgaagcccgaatgttccaaggtgctgccgatcaggatgtggccgtcgcgccgcggaatcgcgtag 

cgccccttggc(^gcaccatgcgcg(k^ggaaatccgccgcgcacttgtagaggatcatctgaccttt^ 

(^cagttcca(wccaagcg(k:ttcaacaactcgcc(k:tccaggcgcctgccgcca(k:agcaccttgtcgc^ 

cgccacgcgaggtcgccacgccgaccactcgatcgccgtcgcgcaaccagccgcgcacctccgtctgttcatgcaactcg 

agattggcgaattgttgcagggatgcccgcaatga 

>ORF33705c (SEQ ID NO:366) 

GTGATATTTCTCTGTTCCTGGCAAATCGGTAGGAGCCCTGTGGTGAGTAGAGATGTAGTAGTGGTAGGCGCTGGCGTCAT 

CGGCCTGTTGACCGCCCGGGAGCTGGCGCTCGCCGGACTGCGGGTGACCCTGGTGGAGCGGGGCGAGAGTGGGCGTGAGG 

CATCCTGGGCGGGAGGCGGGATCGTCTCGCCGCTCTATCCGTGGCGCTACAGCCCGGCGGTGACCGCCCTGGCGCACTGG 

TCGCAGGACTTCTACCCGGCCCTGGGGCAGCGTTTGCTCGACGAGACCGGGCTCGATCCCGAGGTCCATACCGTTGGCCT 

GTACTGGCTGGACCTGGACGACCAGACCGAGGCACTGCAGTGGGCACGCAACCACACCCGGCCGTTGAAGGAAGTGCCGA 

TCGAGGAGGCCTACGCGGCGGTGCCCGGGCTGGGCGCAGGCTTCCAGCGGGCGGTCTACATGTCGGGCGTGGCCAATGTG 

CGCAATCCTCGCCTGGCGCGCTCATTGCGGGCATCCCTGCAACAATTCGCCAATCTCGAGTTGCATGAACAGACGGAGGT 

GCGCGGCTGGTTGCGCGACGGCGATCGAGTGGTCGGCGTGGCGACCTCGCGTGGCGAGATCCGTGGCGACAAGGTGCTGC 

TGGCGGCAGGCGCCTGGAGCGGCGAGTTGTTGAAGCCGCTTGGCCTGGAACTGCCCGTGGTACCGGTGAAAGGTCAGATG 

ATCCTCTACAAGTGCGCGGCGGATTTCCTGCCGCGCATGGTGCTGGCCAAGGGGCGCTACGCGATTCCGCGGCGCGACGG 

CCACATCCTGATCGGCAGCACCTTGGAACATTCGGGCTTCGACAAGACGCCGACCGACGAGGCGCTGGAAAGCCTCAGGG 

CGTCTGCGGCAGAACTGTTGCCGGAACTGGCGGACATGCAGCCGGTGGCCCACTGGGCAGGGTTGCGCCCGGGCTCTCCC 

GAAGGCATCCCCTATATCGGTCCGGTGCCTGGCTTCGACGGGCTCTGGCTGAATACCGGGCACTACCGCAACGGGCTGGT 

CCTG(KACCGGCGTCGT(K:CGTCTGCT(MCGGATCTCATGAGCGGGCGGGAACCGATCATCGACCCGGCCCCCTA^ 

CGGCTGGTCGCCTCTGA 
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>ORF32832 {SEQ ID NO:368) 

66CTTTCCA6C6CCTC6TC6GTCG6C6TCTT6TCGAAGCCCGAATGTTCCAA66T6CT6CC6ATCA66AT6TGGCC6TCG 

cgccgcggaatcgcgta(k:gccccttggccagcaccatgcgcggcaggaaatccgccgcgcacttgtagaggatcatctg 
acctttcaccggtaccacgggcagttccaggccaagcggcttcaacaactcgccgctccaggcgcctgccgccagcagca 
ccttgtcgccacggatctcgccacgcgaggtcgccacgccgaccactcgatcgccgtcgcgcaaccagccgcgcacctcc 
gtctgttcatgcaactcgagattggcgaattgttgcagggatgcccgcaatgagcgcgccaggcgaggattgcgcacatt 
ggccacgcccgacatgtagaccgcccgctggaagcctgcgcccagcccgggcaccgccgcgtaggcctcctcgatcggca 

CTTCCTTCAACGGCCGGGTGTGGTTGCGTGCCCACTGCAGTGCCTCGGTCTGGTCGTCCAGGTCCAGCCAGTACAGGCCA 
ACGGTATGGACCTCGGGATCGAGCCCGGTCTCGTCGAGCAAACGCTGCCCCAGGGCCGGGTAGAAGTCCTGCGACCAGTG 
CGCCAGGGCGGTCACCGCCGGGCTGTAGCGCCACGGATAGAGCGGCGAGACGATCCCGCCTCCCGCCCAGGATGCCTCAC 
GCCCACTCTCGCCCCGCTCCACCAGGGTCACCCGCAGTCCGGCGAGCGCCAGCTCCCGGGCGGTCAACAGGCCGATGA 

>ORF33547c (SEQ ID NO:370) 

GGCATCCTGGGCGGGAGGCGGGATCGTCTCGCCGCTCTATCCGTGGCGCTACAGCCCGGCGGTGACCGCCCTGGCGCACT 
GGTCGCAGGACTTCTACCCGGCCCTGGGGCAGCGTTTGCTCGACGAGACCGGGCTCGATCCCGAGGTCCATACCGTTGGC 
CTGTACTGGCTGGACCTGGACGACCAGACCGAGGCACTGCAGTGGGCACGCAACCACACCCGGCCGTTGAAGGAAGTGCC 
GATCGAGGAGGCCTACGCGGCGGTGCCCGGGCTGGGCGCAGGCTTCCAGCGGGCGGTCTACATGTCGGGCGTGGCCAATG 
TGCGCAATCCTCGCCTGGCGCGCTCATTGCGGGCATCCCTGCAACAATTCGCCAATCTCGAGTTGCATGA 

>ORF33205 (SEQ ID NO:372) 

GCGCGCCAGGCGAGGATTGCGCACATTGGCCACGCCCGACATGTAGACCGCCCGCTGGAAGCCTGCGCCCAGCCCGGGCA 
CCGCCGCGTAGGCCTCCTCGATCGGCACTTCCTTCAACGGCCGGGTGTGGTTGCGTGCCCACTGCAGTGCCTCGGTCTGG 
TCGTCCAGGTCCAGCCAGTACAGGCCAACGGTATGGACCTCGGGATCGAGCCCGGTCTCGTCGAGCAAACGCTGCCCCAG 

GGCCGGGTAGAAGTCCflmCtt^^ 

TCCCGCCTCCCGCCCAGGATGCCTCACGCCCACTCTCGCCCCGCTCCACCAGGGTCACCCGCAGTCCGGCGAGCGCCAGC 
TCCCGGGCGGTCAACAGGCCGATGACGCCAGCGCCTACCACTACTACATCTCTACTCACCACAGGGCTCCTACCGATTTG 
CCAGGAACAGAGAAATATCACTCAAAGGGATCAGATGCTGACGAATTGCCTGCTTCAACGAACTCAGTCGAATCTAGTCC 
CGGTGAAAAGCCCATCATACCCGCAGAGGTATTCATCCCATGA 

>ORF33512(SEQIDNO:374) 

AGCGGCGAGACGATCCCGCCTCCCGCCCAGGATGCCTCACGCCCACTCTCGCCCCGCTCCACCAGGGTCACCCGCAGTCC 
GGCGAGCGCCAGCTCCCGGGCGGTCAACAGGCCGATGACGCCAGCGCCTACCACTACTACATCTCTACTCACCACAGGGC 
TCCTACCGATTTGCCAGGAACAGAGAAATATCACTCAAAGGGATCAGATGCTGACGAATTGCCTGCTTCAACGAACTCAG 
TCGAATCTAGTCCCGGTGAAAAGCCCATCATACCCGCAGAGGTATTCATCCCATGAAATCGAGTGGTTTGAATTTGGTGG 
AACTATCGATAGTCCTATCGATCCTTGCGATAGGCGTGACAATTGCGCTGCCCACCCTCCCCGACAGAATGAAGCGGGAC 
ATTAG 

>ORF33771(SEQID NO:376) 

AAAGCCCATCATACCCGCAGAGGTATTCATCCCATGAAATCGAGTGGTTTGAATTTGGTGGAACTATCGATAGTCCTATC 
GATCCTTGCGATAGGCGTGACAATTGCGCTGCCCACCCTCCCCGACAGAATGAAGCGGGACATTAGCCGTGATATTGGTG 
ACAGCCTGACTAGTCATGTGATGGCTGCGCGGGCTAGCAGCATACAGAACGGCGTGATCATCGAGGTGTGCGGTAGCGGT 
GACGGCAGTACCTGCAGCGAGGAATGGCATCTCGGCTGGTTCAGCCGTAACGACAGGAGCCAACAGATACTGGCCCGGCA 
TGAAAATACGAGTCGCACCGATATTCATTGGCGGGGCTTCGACAAGCGACTGCGCTACCTGCCTAATGGCACCAGCCCTA 
CAGGTAACGGGCGTTTCTTCGAATGTAAGGACGATCGCATCGAGTGGCAATTGGTGCTCAATCGGCAAGGCCGCCTCAGG 
GTGGCGGGAAAGAGCGAAAATAAAAAGCTCTCTTACCTGTGCTCCAGGCGGTGA 

>ORF34385c (SEQ ID NO:378) 

TGGAGAGCGCATTGTCCCTGTAGCAGAGACAGCCGGAGCGGAGAGTGGGATGACTGGCAAACGGTATGTGAAACAGTTCT 
CTCACCGCCTGGAGCACAGGTAAGAGAGCTTTTTATTTTCGCTCTTTCCCGCCACCCTGAGGCGGCCTTGCCGATTGAGC 
ACCAATTGCCACTCGATGCGATCGTCCTTACATTCGAAGAAACGCCCGTTACCTGTAGGGCTGGTGCCATTAGGCAGGTA 
GCGCAGTCGCTTGTCGAAGCCCCGCCAATGAATATCGGTGCGACTCGTATTTTCATGCCGGGCCAGTATCTGTTGGCTCC 
TGTCGTTACGGCTGAACCAGCCGAGATGCCATTCCTCGCTGCAGGTACTGCCGTCACCGCTACCGCACACCTCGATGATC 
ACGCCGTTCTGTATGCTGCTAGCCCGCGCAGCCATCACATGACTAGTCAGGCTGTCACCAATATCACGGCTAATGTCCCG 
CTTCATTCTGTCGGGGAGGGTGGGCAGCGCAATTGTCACGCCTATCGCAAGGATCGATAG 
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>ORF33988 (SEQ ID NO:380) 

TCATC6AGGTGT6CGGTA6CG6TGAC66CA6TACCTGCA6CGA6GAAT66CATCTCGGCT6GTTCAGCCGTAAC6ACA6G 
AGCCAACAGATACTGGCCCGGCATGAAAATACGAGTCGCACCGATATTCATTGGCGGGGCTTCGACAAGCGACTGCGCTA 
CCTGCCTAATGGCACCAGCCCTACAGGTAACGGGCGTTTCTTCGAATGTAAGGACGATCGCATCGAGTGGCAATTGGTGC 
TCAATCGGCAAGGCCGCCTCAGGGTGGCGGGAAAGAGCGAAAATAAAAAGCTCTCTTACCTGTGCTCCAGGCGGTGAGAG 
AACTGTTTCACATACCGTTTGCCAGTCATCCCACTCTCCGCTCCGGCTGTCTCTGCTACAGGGACAATGCGCTCTCCACT 
AG 

>ORF34274 (SEQ ID NO:382) 

AAAGCTCTCTTACCTGTGCTCCAGGCGGTGAGAGAACTGTTTCACATACCGTTTGCCAGTCATCCCACTCTCCGCTCCGG 
CTGTCTCTGCTACAGGGACMTGCGCTCTCCACTAGGCAAGATTATCTGGGCCTTTTCCTTGTGGAGTACTGCATGCGCT 
CTATTTGTCGCAGCGCCGGCTTTTCCCTGATCGAGTTGATGATGGTGTTGGTTCTGGTCGCCATATTCGCCAGCATTGCC 
GTACCCAGTTTCAACGCCTTGATCGAGCGCAACCGAATCCAGACTGCCAGCGAGGAACTCTACAGCCTGCTTCAGTACGC 
TCG(^GCGAAGCTGTAAACCGTCATGCCAATGTGAGCAT(^GGGCGACGCAGAACAATGACTGGGCAAAAGGCCTGGAAA 
TCATCAGCGGCGCGACCACCGTGCAAAAGCACCAAGGTTTCCAGCAGGTCTCGCTATCCGCCAGCAGTGCGACTGCGGAG 
CTGACCTTCAACGCTACCGGCACACTTAGCAACCAGGCTGCAAACATTGACATAAAGGTCTGCTTCGCCGGTGACAAAAG 
TACAGGACGTCTGCTTACCGTTCAGCCCAGTGGACGCGTGATCCTGTACCCATCTTCAAAGCAACCGGACAGCTGTAACT 
GA 

>ORF34726c (SEQ ID NO:384) 

CGAGACCTGCTGGAAACCTTGGTGCTTTTGCACGGTGGTCGCGCCGCTGATGATTTCCAGGCCTTTTGCCCAGTCATTGT 
TCTGCGTCGCCCTGATGCTCACATTGGCATGACGGTTTACAGCTTCGCTGCGAGCGTACTGAAGCAGGCTGTAGAGTTCC 
TCGCTGGCAGTCTGGATTCGGTTGCGCTCGATCAAGGCGTTGAAACTGGGTACGGCAATGCTGGCGAATATGGCGACCAG 
AACCAACACCATCATCAACTCGATCAGGGAAAAGCCGGCGCTGCGACAAATAGAGCGCATGCAGTACTCCACAAGGAAAA 
GGGCCAGATAATCTTGCCTAGTGGAGAGCGCATTGTCCCTGTAGCAGAGACAGCCGGAGCGGAGAGTGGGATGACTGGCA 
AACGGTATGTGAAACAGTTCTCTCACCGCCTGGAGCACAGGTAA 

>ORF34916 (SEQ ID NO:386) 

GGAAAGCCCATGTCTCGAGAAACGGGTTTCAGCATGATCGAAGTACTGGTTGCTCTGGTGCTGATCAGCATTGGCGTACT 
GGGCATGGTTGCCATGCAAGGGCGCACGATCCAGTACACGCAGGAGTCGGTACAACGCAATGCCGCAGCAATGCTTGCTA 
GCGACCTGATGGAAATAATGCGTGCGGACCCAGATGCCGTACTCAATCTACGCGCCCAACTACGCGAAGACTCGGTCTAC 
TACAAGGCCAAGGGCAGCGACTTTCCCGCAGCCCCAGCGCGCTGCGCGCCATTGCCAGCAGATGCTAAGGAACGTCTCGG 
CTGCTGGGCCCAACAGGCCTCGAAAGACTTGCCGGGAGCCTCCGCACTCTTGAATAGCCAATTCTACATTTGTCGCAGCC 
CAACCCCGGGTACCTGCGACAACACCAAAGGCTCGGCCATCGAAATCCAGGTTGCCTGGCGAGCCATGGATGGAGCGTGT 
TTCAACGCCTCTGACTCCACCTTGTGCACCTACAGCGTCCGCTCCGAATTGTGA 

>ORF35464c (SEQ ID NO:388) 

AGAGCATGCTTGTTCTCACAATTCGGAGCGGACGCTGTAGGTGCACAAGGTGGAGTCAGAGGCGTTGAAACACGCTCCAT 
CCATGGCTCGCCAGGCAACCTGGATTTCGATGGCCGAGCCTTTGGTGTTGTCGCAGGTACCCGGGGTTGGGCTGCGACAA 
ATGTAGAATTGGCTATTCAAGAGTGCGGAGGCTCCCGGCAAGTCTTTCGAGGCCTGTTGGGCCCAGCAGCCGAGACGTTC 
CTTAGCATCTGCTGGCAATGGCGCGCAGCGCGCTGGGGCTGCGGGAAAGTCGCTGCCCTTGGCCTTGTAGTAGACCGAGT 
CTTCGCGTAG 

>ORF35289 (SEQ ID NO:390) 

ATAGCCAATTCTACATTTGTCGCAGCCCAACCCCGGGTACCTGCGACAACACCAAAGGCTCGGCCATCGAAATCCAGGTT 
GCCTGGCGAGCCATGGATGGAGCGTGTTTCAACGCCTCTGACTCCACCTTGTGCACCTACAGCGTCCGCTCCGAATTGTG 
AGAACAAGCATGCTCTTCAGCAAAATGCAGAAAGGCCTATCGATGGTAGAACTGCTCGTGGG^CTCGCTATAAGCAGCTT 
CCTGATCCTGGGGATCAGCCAGATCTACATCGACAACAAACGCAACTATCTTTTCCAGCAAGGCCAGGCCGGCAACCAGG 
AAAATAG 
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>ORF35410 (SEQ ID NO:392) 

CTCCACCTTGTGCACCTACA6CGTCCGCTCCGAATTGT6AGAACAA6CATGCTCTTCA6CMAAT6CAGAAAGGCCTATC 

(»TGGTAGMCT(X:TCGTGGCACTCGCTATMGCAGC^^^ 

GCMCTATCTTTTCCAGCAAGGCCA(&CCGGCAACM^ 

GATAAGACAGCCTATCGTCGCCTTCACGACGACAACATGGAGAATGCTTTCAAATCCGCGACATTCAATGGCTGTCGTGC 
ATTTGTGGCTGGCGAGACTATCGCTGCGGCAACTGCCCTCAAGGCGGGTGAGTACGGTGTCTGCTTGCGCTATCAACCCG 
CCTACAAAGGGGAGCATGATTGCCTCGGTAATGAAATTACCGGAGTTCCGGAAAAGCCCTTCACAAATACTCCCCCTGTC 
GTCGTTCGCCTGGTCTACCTACCGAGCGCCGGTACCCTGAGTTGCAGTCGTCCCGATATCGCCCAGTCGAAATCGGGAGA 
ATTGGTCAGTGGTCTCACAGACTTCCGCTTGGAAGCGGGGGTCGGGCCAGCAGATCGTAGCGAACGCAAAGTATCCAGCT 
TCGTCGCACTACAGGATGTCGCCGGTCGTCCTATCCGAGCATTGCGCTTCTCAATCCTGGCAGGCAGCGACAATACAAGC 
CTGCGCACAGGAGATGATAGCCAGGCACGCGATCGCTGGATCGTCCTTTATCCCGAGAGCAAAAGCGCCATCGAGGCCGC 
AGACAAAG^CAGATTTACCAAATAGCGCGTGGTAACCAAACCATCAGGAATCTCATGCCATGA 

>ORF35907c (SEQ ID NO:394) 

GTAGACCAGGCGAACGACGACAGGGGGAGTATTTGTGAAGGGCTTTTCCGGAACTCCGGTAATTTCATTACCGAGGCAAT 
CATGCTCCCCTTTGTAGGCGGGTTGATAGCGCAAGCAGACACCGTACTCACCCGCCTTGAGGGCAGTTGCCGCAGCGATA 
GTCTCGCCAGCCACAAATGCACGACAGCCATTGAATGTCGCGGATTTGAAAGCATTCTCCATGTTGTCGTCGTGAAGGCG 
ACGATAGGCTGTCTTATCCAGTTGTTGCTGCAGCAGCATAAGAACGAAGCGGCTATTTTCCTGGTTGCCGGCCTGGCCTT 
GCTGGAAAAGATAGTTGCGTTTGTTGTCGATGTAGATCTGGCTGATCCCCAGGATCAGGAAGCTGCTTATAGCGAGTGCC 
ACGAGCAGTTCTACCATCGATAG 

>ORF35534 (SEQ ID NO.-396) 

TCCTGGGGATCAGCCAGATCTACATCGACAACAAACGCAACTATCTTTTCCAGCAAGGCCAGGCCGGCAACCAGGAAAAT 
AGCCGCTTCGTTCTTATGCTGCTGCAGCAACAACTGGATAAGACAGCCTATCGTCGCCTTCACGACGACAACATGGAGAA 
TGCTTTCAAATCCGCGACATTCAATGGCTGTCGTGCATTTGTGGCTGGCGAGACTATCGCTGCGGCAACTGCCCTCAAGG 
CGGGTGAGTACGGTGTCTGCTTGCGCTATCAACCCGCCTACAAAGGGGAGCATGATTGCCTCGGTAATGAAATTACCGGA 
GTTCCGGAAAAGCCCTTCACAAATACTCCCCCTGTCGTCGTTCGCCTGGTCTACCTACCGAGCGCCGGTACCCTGA 

>ORF35930 (SEQ ID NO:398) 

GTTGCAGTCGTCCCGATATCGCCCAGTCGAAATCGGGAGAATTGGTCAGTGGTCTCACAGACTTCCGCTTGGAAGCGGGG 
GTCGGGCCAGCAGATCGTAGCGAACGCAAAGTATCCAGCTTCGTCGCACTACAGGATGTCGCCGGTCGTCCTATCCGAGC 
ATTGCGCTTCTCAATCCTGGCAGGCAGCGACAATACAAGCCTGCGCACAGGAGATGATAGCCAGGCACGCGATCGCTGGA 
TCGTCCTTTATCCCGAGAGCAAAAGCGCCATCGAGGCCGCAGACAAAGGCCAGATTTACCAAATAG 

>ORF36246 (SEQ ID NO:400) 

CCAAACCATCAGGAATCTCATGCCATGACCCTGCGCCATACCTCTCGACAGCAGGGATCCACGTTGTTGATCTCGCTGGT 
TATCTTGTTGATGATCACGCTCCTCGCCGTTTCCAACATGCGCGAGGTGTCACTGGAAAGCCGTATCACCGGCAATCTCA 
TCGAACAGAAGCGCCTGCGCAATGCGGGCGAAGCTGGGCTACGCGAAGGTGAACGACGCTTTTTCAATACCATCAAGCCC 
CCAGAGGTCGGCAGCGGATGCGCCGATAGCAATGTCAAACGGCCTTGCATACTGAACCTGAGTGCCCTCTCCGTACCCCG 
AGATGACGTGCACAACAATCCGGTGGCAGCCCTGAACGGCAAGACAGATAACGCCAATTCACGTGTCTGGATGCCCTACC 
GAGGCAGCGATCTGAATAACCCTACGCAGATCGACAAAGACCGCGCAGTCACCTGGCAGACCATCACGGTGCCCGCTGGC 
GAACAGAACAACGAAGCGGAAAATCCCGAGTACGGCAACATGATGCGCGGGGTCGGCACGTTCTACTACGAAACCAACAG 
CCGCGCCCTCAACAAGGCGGGCGGAGAGACTGTTCTACAGGCCGTTCATGCACGCCTGTATACCAACTGA 

>ORF26640c (SEQ ID NO:402) 

GGCATCCAGACACGTGAATTGGCGTTATCTGTCTTGCCGTTCAGGGCTGCCACCGGATTGTTGTGCACGTCATCTCGGGG 
TACGGAGAGGGCACTCAGGTTCAGTATGCMGGCCGTTTGACATTGCTATCGGCGCATCCGCTGCCGACCTCTGGGGGCT 
TGATGGTATTGAAAAAGCGTCGTTCACCTTCGCGTAGCCCAGCTTCGCCCGCATTGCGCAGGCGCTTCTGTTCGATGAGA 
TTGCCGGTGATACGGCTTTCCAGTGACACCTCGCGCATGTTGGAAACGGCGAGGAGCGTGATCATCAACAAGATAACCAG 
CGAGATCAACAACGTGGATCCCTGCTGTCGAGAGGTATGGCGCAGGGTCATGGCATGA 
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>ORF36769 (SEQ ID NO:404) 

TGC6C66GGTCG6CAC6TTCTACTACGAAACCAACA6CCGCGCCCTCAACAAG6CG66CGGAGA6ACTGTTCTACA6GCC 
GTTCATGCACGCCTGTATACCAACTGACTGGAGCCAGCGCATGATCCACCAGATTACCCGCGCAGGAAAAAGCCTGCTGG 
CTGCAGGTTGCACCCTGAGCATCCTGTTCGCCTCTGACAGTTATGCCGCCACGGCCCTGAATGTCAGCCAGCAACCCCTG 
TTCCTAACCCAGGGCGnGCTCCCAACCTGCTGTTCACTCTAGATGACTCAGGCAGTATGGCCTGGGCTTACGTGCCCGA 
CGGTATTAGCGGGAATAGCGGCAGAGCGGGACGTTCCAGCGATTACAACGCACTGTACTACAACCCCGATTATGCTTACC 
AAGTGCCCAAGAAATTGACACTGTCAGGCGATCAGATCATCGTTTCCGACTATCCAGTGCCACGCTTCACAGCAGCCTGG 
CAGGATGGCTACGCC(^GGCTCCACCACCAACCTGAGCAATAACTATCGCCCT(^TGGGGAACCGGCTGGCTTGGTTG 
CATCGATAGCAGCTGCAATACCGGGAGAGCTTATTACTATACTTATAAGGTAAGCGCTAGCTGCCCTGCACAGCCGGTGA 
GCAGCTCCAACTCCTGTTATACCTACAATGCTCTTCCTACCAGTCAGGAAAGCAACTTTGCGATATGGTACTCCTACTAT 
CGCAACCGCATCCTGGCCACAAAGACCGCTGCCAACCTGGCCTTTTACAGCCTGCCGGAAAACGTGCGTCTCACTTGGGG 
GGCCCTGAACACCTGTAGCATCGGCGCCAACAGCAGAAGCTGCCAAAACAATGCCCTGCTCCAATTCAACAAGCAGCACA 
AAATCAATTTCTTCAATTGGCTGGCGAACAGCCCGGCCAGCGGCGGTACTCCTCTGCATGCGGCTCTTGACCGAGCCGGA 
CGCTTCTTGCAAACCAACGGCACAGCTTATACCACCGAAGACGGAAAGACATATTCCTGCCGGGCCAGCTATCACATCAT 
GATGACCGACGGTATCTGGAACGGTCGGAACGTCACCCCCGGCAATCTCGACAACCAGAACCAGACCTTTCCTGATAGCA 
CCCTCTATAGGCCACAGCCCCCTTATGCCGACAGCAATGCCAGCTCATTGGCTGACCTGGCTTTCAAATACTGGACCACA 
GACTTACGTCCCAGCATCGACAATGACCTGAAGCCTTTCATGGCCTACAAGAGTGGGGACGATTCCAAGGATTACTGGGA 
CCCTCGCAACAACCCAGCCACTTGGCAACACATGGTCAACTTTACCGTTGGCCTAGGTCTTTCCTATTCGCTCACATTGA 
ACTCTGCACCAACTTGGACAGGCAGCACCTTTGGCAACTACGAGGAGTTGATGGCTGGAAGCAAGGCTTGGCCCAGCGTC 
GATAACGACGCCGCACCCGGTAACGTCTACGACCTCTGGCATGCAGCTATCAACTCTCGTGGAGACTTCTTTAGCGCGGA 
ATCACCGGACTCTCTGGTTCAGGCTTTCAATAAGATCCTGACACGGATTTCCGAGCGCAACACCTCCTCCTCCAAACCAG 
CAATGACTTCCGCGCTGCAGGATGACGGAACCGGCGACAAGCTGATCCGCTACAGCTACCAGTCCAGCTTTGCCAGTGAC 
AAGAACTGGGCGGGCGACCTTATACGTTACAAGGTGGAGTCGACTTCCACCGGTTCGACCAAAACCCAGGAATGGAGCGC 
CGGCGCACTGCTGGACAACCGAGCTCCCGCTACCCGTAATATTTACATCGCCAGCAATAGCGGAACCAACCGCCTTAAGC 
CTTTCACATGGAGCAATATTGAGGGAAGT(^GTTAGCCACTTGGCTGAACCGCAACCCGGACAAGGA(MTCAGGCCGAC 
ACCAAAGGAGCACAGCGGGTCGACTTCATCCGTGGCCAGCAGAATATGGATGGATTCCGGCAACGACAGGCGGTGTTAGG 
GGACATCGTGCACTCGTCTCCAGCCGTGGTCGGACCGGCCCAATACCTCACTTATCTGGCCAACCCCATCGAACCCAGCG 
GCGACTACGGCACATTCAAGACAGAGGCAGACCAGCGCAGCCCTAGAGTTTATGTTGGATCCAACGATGGCATGTTGCAT 
GGTTTG^ACATCAAAACCGGCGTGGAAGAGTTCGCTTTCATCCCTACAGCAGTATTCGAAAAGCTTAACAAGCTTACCGG 
CATCAGCTACCAGGGCGGTGCCCACCAATATTTCGTCGACGCTACACCGGTCGTCAGCGATGCCTTTTTCGATGGAGCTT 
GGCACACTGTTCTGATCGGAACGCTTGGTGCTGGAGGTCGCGGCCTGTTCGCACTCGATGTAACCAAGCCGGACGATGTC 
AAGCTGCTTTGGGAATACGATAGCAGTACCGACTCGGACCTTGGTTACACCTTCTCOU^ACCTACCGTAGCCAGACTGCA 
CAGCGGACAATGGGCAGTAGTTACCGGCAACGGCTATGGAAGCGATAATGACAAGGCAGCTTTACTGCTGATTGATTTGA 
AAAAGGGAACGCTGATCAAGAAGCTGGAAGTCCAAAGCGAGCGCGGAATAGCCAATGGCCTATCGACGCCTCGCCTGGCT 
GATAACAACAGCGATGGCATTGCTGACTACGCCTATGCTGGCGATCTGCAGGGAAATATCTGGCGCTTCGATTTGATCGG 
CAATACCCGCAACGACGACCCAGACACAAATACCTCTATCAATCCCTTCAAGCCCGGAGATGTAGATCCTTCTGCTTTCA 
GAGTATCGTTCAGCGGCGCCCCGCTTTTCCGTGCTCGCGCCGACAACAATACTCGTCAGCCCATCACGGCTCCGCCTACC 
TTGGTACGCCATCCTAGCCGTAAGGGCTACATCGTCATCGTAGGTACAGGAAAATACTTCGAGGACGATGACGCTCAGGC 
CGATACCAGCCGAGCCATGACGCTCTATGGTATCTGGGATCGCCAGACCAAGGGCGAAAGCGCAAACAGTACCCCAACCA 
TCGACCGCAACGCCCTCACAGCCCAAACCATGACAACAGAGGCGAACTCCACATTCGGTAGCGTGAACAGGAATATTCGG 
CTTATTAGCCAAAACCCGGTGAAGTGGTACAAAGACGGAGCAACCGGTACCGCGAACTCGGATGTGGCTAGCTATGGCTG 
GCGACTGAATCTGGAGGTCAATAGCAGCAAGAAAGGCGAAATGATGATCGAAGATATGTTCGCTGCCGGCCAAGTGCTTC 
TATTGCAGACCTTGACACCGAACGACGACCCTTGTGACAGCGGCTCTACCAGCTGGACCTACGGCCTCAATCCATATACT 
GGCGGACGTACCAGTTTCACCGTCTTCGATCTCAAACGTGCGGGTATAGTGGACTCTGGCTCGGATTACAACGGCTCGGT 
CGTATCCGCCTTCCAACAGGATGGACTAGGTGGCTTGGCCATTACCCAGAACGAACAGCGTCAATCCGAGGCTTGCACTG 
GTGATGAGTGCATCATCTTCAACCCCAGCGACAAGAGTAACGGACGACAAACCTGGCGGGTCGTCGAGGAGAAATGA 
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>ORF37932c (SEQ ID NO:406) „„, m 

GCT66CATT6CT6TC6GCATAA6GGGGCTGT66CCTATAGAGGGTGCTATCA6GAAA6GTCTG6TTCTG6TTGTCGA6AT 

TGCCGGGGGTGACGTTCCGACCGTTCCAGATACCGTCGGTCATCATGATGTGATAGCTGGCCCGGCAGGAATATGTCTTT 

CCGTCTTCGGTGGTATAAGCTGTGCCGTTGGTTTGCAAGAAGCGTCCGGCTCGGTCAAGAGCCGCATGCAGAGGAGTACC 

GCCGCTGGCCGGGCTGTTCGCCAGCCAATTGAAGAAATTGATTTTGTGCTGCTTGTTGAATTGGAGCAGGGCATTGTTTT 

GGCAGCTTCTGCTGTTGGCGCCGATGCTACAGGTGTTCAGGGCCCCCCAAGTGAGACGCACGTTTTCCGGCAGGCTGTAA 

AAGGCCAGGTTGGCAGCGGTCTTTGTGGCCAGGATGCGGTTGCGATAGTAGGAGTACCATATCGCAAAGTTGCTTTCCTG 

ACTGGTAGGAAGAGCATTGTAGGTATAACAGGAGTTGGAGCTGCTCACCGGCTGTGCAGGGCAGCTAGCGCTTACCTTAT 

AAGTATAGTAATAAGCTCTCCCGGTATTGCAGCTGCTATCGATGCAACCAAGCCAGCCGGTTCCCCATTGAGGGCGATAG 

TTATTGCTCAGGTTGGTGGTGGAGCCTTGGGCGTAGCCATCCTGCCAGGCTGCTGTGAAGCGTGGCACTGGATAGTCGGA 

AACGATGATCTGATCGCCTGA 
>ORF38640c (SEQ ID NO:408) 

CTGACTTCCCTCAATATTGCTCCATGTGAAAGGCTTAAGGCGGTTGGTTCCGCTATTGCTGGCGATGTAAATATTACGGG 
TAGCGGGAGCTCGGTTGTCCAGCAGTGCGCCGGCGCTCCATTCCTGGGTTTTGGTCGAACCGGTGGAAGTCGACTCCACC 
TTGTAACGTATAAGGTCGCCCGCCCAGTTCTTGTCACTGGCAAAGCTGGACTGGTAGCTGTAGCGGATCAGCTTGTCGCC 
GGTTCCGTCATCCTGCAGCGCGGAAGTCATTGCTGGTTTGGAGGAGGAGGTGTTGCGCTCGGAAATCCGTGTCAGGATCT 
TATTGAAAGCCTGAACCAGAGAGTCCGGTGA 

>ORF39309c (SEQ ID NO:410) 

AGCTGCCTTGTCATTATCGCTTCCATAGCCGTTGCCGGTAACTACTGCCCATTGTCCGCTGTGCAGTCTGGCTACGGTAG 
GTTTGGAGAAGGTGTAACCAAGGTCCGAGTCGGTACTGCTATCGTATTCCCAAAGCAGCTTGACATCGTCCGGCTTGGTT 
ACATCGAGTGCGAACAGGCCGCGACCTCCAGCACCAAGCGTTCCGATCAGAACAGTGTGCCAAGCTCCATCGAAAAAGGC 
ATCGCTGACGACCGGTGTAGCGTCGACGAAATATTGGTGGGCACCGCCCTGGTAGCTGATGCCGGTAAGCTTGTTAAGCT 
TTTCGAATACTGCTGTAGGGATGAAAGCGAACTCTTCCACGCCGGTTTTGATGTTGAAACCATGCAACATGCCATCGTTG 
GATCCAACATAAACTCTAGGGCTGCGCTGGTCTGCCTCTGTCTTGAATGTGCCGTAGTCGCCGCTGGGTTCGATGGGGTT 
GGCCAGATAAGTGAGGTATTGGGCCGGTCCGACCACGGCTGGAGACGAGTGCACGATGTCCCCTAA 

>ORF38768 (SEQ ID NO:412) 

GGGACATCGTGCACTCGTCTCCAGCCGTGGTCGGACCGGCCCAATACCTCACTTATCTGGCCAACCCCATCGAACCCAGC 
GGCGACTACGGCACATTCAAGACAGAGGCAGACCAGCGCAGCCCTAGAGTTTATGTTGGATCCAACGATGGCATGTTGCA 
TGGTTTCAACATCAAAACCGGCGTGGAAGAGTTCGCTTTCATCCCTACAGCAGTATTCGAAAAGCTTAACAAGCTTACCG 
GCATCAGCTACCAGGGCGGTGCCCACCAATATTTCGTCGACGCTACACCGGTCGTCAGCGATGCCTTTTTCGATGGAGCT 

TGGCACACTGTTCTGA 
>ORF40047c (SEQ ID NO:414) 

AAGCACTTGGCCGGCAGCGAACATATCTTCGATCATCATTTCGCCTTTCTTGCTGCTATTGACCTCCAGATTCAGTCGCC 
AGCCATAGCTAGCCACATCCGAGTTCGCGGTACCGGTTGCTCCGTCTTTGTACCACTTCACCGGGTTTTGGCTAATAAGC 
CGAATATTCCTGTTCACGCTACCGAATGTGGAGTTCGCCTCTGTTGTCATGGTTTGGGCTGTGAGGGCGTTGCGGTCGAT 
GGTTGGGGTACTGTTTGCGCTTTCGCCCTTGGTCTGGCGATCCCAGATACCATAGAGCGTCATGGCTCGGCTGGTATCGG 
CCTGAGCGTCATCGTCCTCGAAGTATTTTCCTGTACCTACGATGACGATGTAGCCCTTACGGCTAGGATGGCGTACCAAG 
GTAGGCGGAGCCGTGATGGGCTGACGAGTATTGTTGTCGGCGCGAGCACGGAAAAGCGGGGCGCCGCTGAACGATACTCT 
GAAAGCAGAAGGATCTACATCTCCGGGCTTGAAGGGATTGATAGAGGTATTTGTGTCTGGGTCGTCGTTGCGGGTATTGC 
CGATCAAATCGAAGCGCCAGATATTTCCCTGCAGATCGCCAGCATAGGCGTAGTCAGCAATGCCATCGCTGTTGTTATCA 
GCCAGGCGAGGCGTCGATAG 

>ORF40560c (SEQ ID NO:416) 

CCGGCGAGTCCTGTTGTTGGACACGGTTGGGCAAGCGATATGTCTGCCCATCGACTACTACCAGACCGGCGGCAGGATGA 
ACATCCTCGACCACGCCCACATTCTCGAACGTATTCGTGGCACTCAAGGCAAAGGTTGGGCAAGCCAGAGCTAGAGCTGC 
AAGAGCTGTGGCGAGAAGACGTAAGGGGTTCATGTTCATTTCTCCTCGACGACCCGCCAGGTTTGTCGTCCGTTACTCTT 
GTCGCTGGGGTTGAAGATGATGCACTCATCACCAGTGCAAGCCTCGGATTGACGCTGTTCGTTCTGGGTAATGGCCAAGC 

CACCTAG 
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GTGOTTGG^ 

GACAAGA6TAAC6GACGACAAACCTG6C66GTCGTCGA66A6AAATGAACATGAACCCCTTAC6TCTTCTC6CCACA6CT 
CTTGCAGCTCTAGCTCTGGCTTGCCCAACCTTTGCCTTGAGTGCCACGAATACGTTCGAGAATGTGGGCGTGGTCGAGGA 
TGTTCATCCTGCCGCCGGTCTGGTAGTAGTCGATGGGCAGACATATCGCTTGCCCAACCGTGTCCAACAACAGGACTCGC 
CGGTCATATTCTTGGTACGTCAGGGACAGACAGTGTCTTTCTCCGGG^AACTCACCAGCGACCTGCCAGAAATCGAGTCG 
TTCTACATTATCAAGCAGGCCCCTCTCGTTCCCTTCGGATCGGAGCAGCAACAATGAAGTCGAACAGAGGCTTCACTCTC 
ATCGAGTTGATGATCGTCGTAGTAATCATCGCTATTCTTGCTGGTATCGCCTACCCCAGCTACGACGAATACGTGAAGCG 
CGGGAATCGCACCGAAGGACAGGCATTACTCAGCGAAGCAGCCGCTACTCAAGAGCGCTATTTTTCACAGAACAATACTT 
ATATCACTACCCAA(M:CGACATCGGCAAGCTGCATATGCGCAACACATCGGGCACCACAGTGAAGTCCTCCACAGGC^ 
TACAGCCTTACCGTCGATACGGTAGCC^CGACGGAGGTTATCKCTTATCGCTAACCAJSGCATTCAACGATCTTGATTG 
TGGCAACCTGACCTTGACCGCCAACGGCGAGAAAGGCCGGACTGGAAGCAAGAAGAGCGTTGCAGAATGCTGGCGCTAA 

>ORF40329 (SEQ ID NO420) 

CGGACGACAAACCTGGCGGGTCGTCGAGGAGAAATGAACATGAACCCCTTACGTCTTCTCGCCACAGCTCTTGCAGCTCT 
AGCTCTGGCTTGCCCAACCTTTGCCTTGAGTGCCACGAATACGTTCGAGAATGTGGGCGTGGTCGAGGATGTTCATCCTG 
CCGCCGGTCTGGTAGTAGTCGATGGGCAGACATATCGCTTGCCCAACCGTGTCCAACAACAGGACTCGCCGGTCATATTC 
TTGGTACGTCAGGGACAGACAGTGTCTTTCTCCGGCAAACTCACCAGCGACCTGCCAGAAATCGAGTCGTTCTACATTAT 

caagcaggcccctctcgttcccttcggatcggagcagcaacaatga 
aStctgtt^ 

tttctggcaggtcgctggtgagtttgccggagaaagacactgtctgtccctgacgtaccaagaatatgaccggcgagtcc 
tgttgttggacacggttgggcaagcgatatgtctgcccatcgactactaccagaccggcggcaggatgaacatcctcgac 
cacgcccacattctcgaacgtattcgtggcactcaaggcaaaggttgggcaagccagagctag 

tSmgggSgam 

acagtgtctttctccggcaaactcaccagcgacctgccagaaatcgagtcgttctacattatcaagcaggcccctctcgt 
tcccttcggatcggagcagcaacaatgaagtcgaacagaggcttcactctcatcgagttgatgatcgtcgtagtaatcat 
cgctattcttgctggtatcgcctaccccagctacgacgaatacgtgaagcgcgggaatcgcaccgaaggacaggcattac 
tcagcgaagcagccgctactcaagagcgctatttttcacagaacaatacttatatcactacccaagccgacatcggcaag 
ctgcatatgcgcaacacatcgggcaccacagtgaagtcctccacaggcaaatacagccttaccgtcgatacggtagccaa 

CGACGGAGGTTATCGCCTTATCGCTAA 

gtgKot 

aggctttgtaatcggaatttttgcgcacctgaaaaagcccggcttatgccgggctttgcctttttcttgtctcggcgctt 
tagcgccagcattctgcaacgctcttcttgcttccagtccggcctttctcgccgttggcggtcaaggtcaggttgccaca 
atcaagatcgttgaatgcctggttagcgataaggcgataacctccgtcgttggctaccgtatcgacggtaaggctgtatt 
tgcctgtggaggacttcactgtggtgcccgatgtgttgcgcatatgcagcttgccgatgtcggcttgggtagtgatataa 

GTATTGTTCTGTGA 
>ORF42234c (SEQ ID NO-428) 

TCGACGTCCAGCCGGCCTGAACCGTCGGTCGCTGCGCCCTTCCCAAGCGGGGAGGGCGGTAGCAAGGTTCATTCGTCCAA 
TCACCGCGTCGCCCACGAGACCGCCATGCmTCAAACTCGCCAATCCCCGCGGCTTCTGCGCCGGCGTGGATCGCGCCA 
TCGAGATCGTCAACCGTGCCCTCGATGTCTTCGGCCCGCCGATCTACGTGCGTCACGAGGTGGTGCACAACAAGTTCGTC 
GTGGACAACCTGCGCCAGCGCGGCGCCATCTTCGTCGAGGAACTCGATCAGGTGCCGGACAACGTCATCGTCATCTTCAG 
CGCCCACGGCGTTTCCCAGGCGGTCCGCAAGGAAGCCGAGGGGCGCGGCCTGAAGGTTTTCGACGCGACCTGCCCGCTGG 
TGACCAAGGTGCACATGGAAGTGGTGCGCTACAGCCGCGACGGCCACGAATGCGTGCTGATCGGGCATGAAGGCCACCCC 
GAGGTGGAAGGCACCATGGGCCAGTACGATGCCAGCAACGGCGGTGCCATCTACCTGGTGGAGGACGAGGCCGACGTCGC 
CGCGCTGGAGGTGCGCAAGCCCGAAGCCCTGCACTACGTGACCCAGACCACCCTGTCGATGGACGACACCTCGAAGGTCA 
TCGATGCCCTGCGCGCCAAGTTCCCGCAGATCCAGGGGCCGCGCAAGAACGACATCTGCTATGCCACCCAGAACCGCCAG 
GATGCCGTGAAGGAACTGGCCGACCAGTGCGACATGGTCCTGGTGGTGGGCAGCCCCAACAGTTCCAACTCCAACCGCCT 
GCGCGAACTCGCCGAGCGCATGGGCACGCCGGCCTACCTGATCGACGGCGCCGAGGACATGCAACGCGGCTGGTTCGACG 
GTGTGCGTCGCATCGGAATCACCGCAGGCGCCTCCGCGCCGGAAGTGCTGGTGCGCGGAGTGATCGCCCAGCTACGTGAG 
TGGGGGGCGTCGGAAGAGCAGGAACTGGAGGGACGGGAGGAGAACATTACCTTCTCGATGCCCAAGGAACTGCGGGTCAA 

GGCTTTGTAA pjg 3 "37 
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>ORF41764c (SEQ ID NO:430) 

AG6CCACCCCGAGGTGGAAGGCACCATGGGCCAGTAC6ATGCCAGCAAC66C66T6CCATCTACCT6GTGGA6GACGA6G 
CCGACGTCGCCGCGCTGGAGGTGCGCAAGCCCGAAGCCCTGCACTACGTGACCCAGACCACCCTGTCGATGGACGACACC 
TCGAAGGTCATCGATGCCCTGCGCGCCAAGTTCCCGCAGATCCAGGGGCCGCGCAAGAACGACATCTGCTATGCCACCCA 
GAACCGCCAGGATGCCGTGAAGGAACTGGCCGACCAGTGCGACATGGTCCTGGTGGTGGGCAGCCCCAACAGTTCCAACT 
CCAACCGCCTGCGCGAACTCGCCGAGCGCATGGGCACGCCGGCCTACCTGATCGACGGCGCCGAGGACATGCAACGCGGC 
TGGTTCGACGGTGTGCGTCGCATCGGAATCACCGCAGGCGCCTCCGCGCCGGAAGTGCTGGTGCGCGGAGTGATCGCCCA 
GCTACGTGA 

>ORF41284 (SEQ ID NO:432) 

CTGGGCGATCACTCCGCGCACCAGCACTTCCGGCGCGGAGGCGCCTGCGGTGATTCCGATGCGACGCACACCGTCGAACC 
AGCCGCGTTGCATGTCCTCGGCGCCGTCGATCAGGTAGGCCGGCGTGCCCATGCGCTCGGCGAGTTCGCGCAGGCGGTTG 
GAGTTGGAACTGTTGGGGCTGCCCACCACCAGGACCATGTCGCACTGGTCGGCCAGTTCCTTCACGGCATCCTGGCGGTT 
CTGGGTGGCATAGCAGATGTCGTTCTTGCGCGGCCCCTGGATCTGCGGGAACTTGGCGCGCAGGGCATCGATGACCTTCG 
AGGTGTCGTCCATCGACAGGGTGGTCTGGGTCACGTAGTGCAGGGCTTCGGGCTTGCGCACCTCCAGCGCGGCGACGTCG 
GCCTCGTCCTCCACCAGGTAGAT<mCCGCCGTTGC^ 

TTCATGCCCGATCAGCACGCATTCGTGGCCGTCGCGGCTGTAGCGCACCACTTCCATGTGCACCTTGGTCACCAGCGGGC 
AGGTCGCGTCGAAAACCTTCAGGCCGCGCCCCTCGGCTTCCTTGCGGACCGCCTGGGAAACGCCGTGGGCGCTGAAGATG 
ACGATGACGTTGTCCGGCACCTGATCGAGTTCCTCGACGAAGATGGCGCCGCGCTGGCGCAGGTTGTCCACGACGAACTT 
GTTGTGCACCACCTCGTGACGCACGTAGATCGGCGGGCCGAAGACATCGAGGGCACGGTTGACGATCTCGATGGCGCGAT 
C^CGCCGGCGCAGMGCCGCGGGGATTGGCGAGTTTGATTTKAT^GGTCTCGTGGGCGACGCGGTGATTGGACGAA 
TGAACCTTGCTACCGCCCTCCCCGCTTGGGAAGGGCGCAGCGACCGACGGTTCAGGCCGGCTGGACGTCGA 

>ORF41598 (SEQ ID NO:434) 

CCTTCGAGGTGTCGTCCATCGACAGGGTGGTCTGGGTCACGTAGTGCAGGGCTTCGGGCTTGCGCACCTCCAGCGCGGCG 
ACGTCGGCCTCGTCCTCCACCAGGTAGATGGCACCGCCGTTGCTGGCATCGTACTGGCCCATGGTGCCTTCCACCTCGGG 
GTGGCCTTCATGCCCGATCAGCACGCATTCGTGGCCGTCGCGGCTGTAGCGCACCACTTCCATGTGCACCTTGGTCACCA 
GCGGGCAGGTCGCGTCGAAAACCTTCAGGCCGCGCCCCTCGGCTTCCTTGCGGACCGCCTGGGAAACGCCGTGGGCGCTG 
A 

>ORF42172c (SEQ ID NO:436) 

CAAGGTTCATTCGTCCAATCACCGCGTCGCCCACGAGACCGCCATGCAAATCAAACTCGCCAATCCCCGCGGCTTCTGCG 
CCGGCGTGGATCGCGCCATCGAGATCGTCAACCGTGCCCTCGATGTCTTCGGCCCGCCGATCTACGTGCGTCACGAGGTG 
GTGCACAACAAGTTCGTCGTGGACAACCTGCGCCAGCGCGGCGCCATCTTCGTCGAGGAACTCGATCAGGTGCCGGACAA 
CGTCATCGTCATCTTCAGCGCCCACGGCGTTTCCCAGGCGGTCCGCAAGGAAGCCGAGGGGCGCGGCCTGAAGGTTTTC^ 
ACGCGACCTGCCCGCTGGTGACCAAGGTGCACATGGAAGTGGTGCGCTACAGCCGCGACGGCCACGAATGCGTGCTGATC 
GGGCATGA 

>ORF42233c (SEQ ID N0:151) 

CGACGTCCAGCCGGCCTGAACCGTCGGTCGCTGCGCCCTTCCCAAGCGGGGAGGGCGGTAGCAAGGTTCATTCGTCCAAT 
CACCGCGTC(KCCACGAGACC<mTGCmT^ 

CGAGATCGTCAACCGTGCCCTCGATGTCTTC(KK:CCGCCGATCTACGTGCGTCACGAGGT(K3TGC^CAACAAGTTCGTCG 
TGGACAACCTGCGCCAGCGCGGCGCCATCTTCGTCGAGGAACTCGATCAGGTGCCGGACAACGTCATCGTCATCTTC^GC 
GCCCACGGCGTTTCCCAGGCGGTCCGCAAGGAAGCCGAGGGGCGCGGCCTGA 
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>ORF42233c (SEQ ID N0:151) 

C6ACGTCCAGCCG6CCTGAACCGTCG6TCGCTGCGCCCTTCCCAA6CGGGGAGG6CGGTA6CAAGGTTCATTCGTCCAAT 
CACCGCGTCGCCCACGAGACCGCCATGCAAATCAMCTCGCCAATCCCCGCGGCTTCTGCGCCGGCGTGGATCGCGCCAT 
CGAGATCGTCAACCGTGCCCTCGATGTCTTCGGCCCGCCGATCTACGTGCGTCACGAGGTGGTGCACAACAAGTTCGTCG 
TGGMAACCTtmaW^^ 

GCCCACGGCGTTTCCCAGGCGGTCCGCAAGGAAGCCGAGGGGCGCGGCCTGA 
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>0RF2 (SEQ ID N0:3) 

SPIQCQGVP6QSEPTH6CR6RHCQAP6RRREQHQYRLQRQRHQLRDDRNQQQL6PQQHPLRRRQRHPAVDEQWR6GLRR 
RLRAARCAGRSASRSATGDRL* 

>0RF3 (SEQ ID N0:5) 

RRSNMEYL6NQSLLTM6AGIAKLLDADENNTSTVFS6NGTSFGTT6TNSNSALNSILSGGVSDIRQWMNKLYGEAFAA 
VYVQPGARVAVHLDQQLAIDYELKGRKVDYSSGAAHATADLD* 

>ORF602c (SEQ ID N0:7) 

SAWSFAEASCCGSIGRRSVCLASRSSRPRLLPIELVAPRSQTSSMLASPWGSISSLLVEHAARVSAQARPAQRRRRGLVQ 
VCCCMSGSRAVIDLAALEFIVDRQLLIEMHCDPRTWLHVDGGEGLPVQLVHPLPDVADAAGEDAVEGRVAVGSGRPEAGA 
VAAEDGTGWLVGVQELGNAGPGSRE* 

>ORF214 (SEQ ID N0:9) 

TSCTGRPSPPSTCSQVRGSQCISISNWRSTMNSRAARSITALEPLMQQQTWTNPLLRLCAGLACALTLAACSTSKEEMLP 
HGEANMLDVWERGATSSIGNSRGRLLLDARQTLRRPIDPQQDASANDQADYTRTASNEIHSQFKRLPNPDLVMYVFPHLA 
GSDPAPVPGYTTVFPFYQRVQYAMPGERTEDY* 

>ORF1242c (SEQ ID N0:11) 

SRPGRRTGQSRVRFRARRRSSAGLLSMRPGRSASNWDRGPRCPRAPVRRMRRANAHPPGASLARRAGTQPRAAGLRTMGR 
DRRGVTLRPAWRHSCSRCWAEEYPWRPVAPDSAQSLLPRPLRPALLNLRERLPVPTEAVCDRAEGFEKSPSIVLRAFARH 
GVLDSLVEGEHGGVARYRGGIAAGQVREHIHHQVGIGQSFELTVDLVAGRAGWGLVIRGGILLRIDWAPQRLPGIEEQP 
ATAVAYRAGRTSLPDVEHVGLAVGQHLLLAGGARCQGQRAGQAGAETKKGVSPSLLLHERLQSCNRPCGP* 

>ORF594 (SEQ ID N0:13) 

PGRLHPHGQQRDPQSVQTTAQSRPGDVCVPAPGRQRSRPGTGLHHRVPLLPASPVRHAGRTHGGLLMGFFQTLLRGRTQP 
QSVPADAPEDSGALDVAAAEEATERYLARLAAMGIPLPNTGSKNGATQAEASRLYDHDPSFVDLLPWAEYLPDEQVMLLE 
DGRSRAAFFELVPLGTEGRDPNWMQNARDALKEALQNSFDEHETSPWIVQFYAQDEISWDNFQEQLRQYVHPRARGSAFS 
EMYLAIMHHLEGISKPGGLFVDTAVSKLPWRGQQRRVRMVVYRRIRKEDAQIRGQDPAAYLKSICERIQGGLANAGIVA 
SRMGGQEIRHWLIRWFNPHPDHLGQAEADLRRFYELVCRPDEPILQDELPLADGTDFSQNLFYRQPVSDATQGVWLFDAM 
PHRVIWDQLNKAPLTGHFTGETLKGDGLNALFDRMPEDTLLCITMWTPQDMLEGHLQQLSKKAVGDTQASIHTREDVA 
TVRRLIGREHKLYRGAIALFVRGRDHTQLEERCITLSHVLLGAGLVPVEPQNEVGPLNSYLRWLPSNFDPNEKRALEWYT 
QMMFAQHIANLSPIWGRTTGTGHPGFTLFNRGGAPLTFDPFNKLDRQMNAHGFIFGPTGSGKSASLTNLICQMLAMYLPR 
MFVAEAGNSFGLLADLAKRFGLSVHRVRLAPGSGVSLAPFADAIKLVESPDQVKVLDAEDIEASDSVQGSKADLEDDQRD 
ILGEMEIVARLMITGGEEKEDARLTRADRSAVRQAILAAARTCAAANRTVLTQDVRDALYEASRSDSTAPERRARIAEMA 
EAMQMF04GADGEMFMEGTPWPEADLTVVDFATYAREGYAAQLGIAYISLLNTVNNIAERDQFKGRPIVKITDEGHIIT 
KHPLLLPYAMITKMWRKLGAWmATQNIDDIPASGAPMLMIEmCLNMPPDEVEKISRFRELSPAQKSMMLSARKE 
SGKFTEGVLLAKGKEYLVRWPPSLYLALAMTENEEKNQRYNIMQATGCDELEAALQVAADLDKARGLPPFPIVFPDQPA 
VECQDE* 

>ORF1040 (SEQ ID N0:15) 

VPARRASDAPGGWAFARRILRTGALGHRGPRSQLDAERPGRIERSPAELLRRARNLTLDCPVLRPGRDQLGQFPGAVEAV 
RPSSSARIGLQRDVPGAHEASPGGHFEAGRTVRRHRRQQAALARTTAPRADGRLPPDPQGGCADSRTGPGGVPEIHLRAY 
PRRPGERRHRRFAHGRTGDQELVDPLVQPAPGSPRPGRGGPTSLLRTGMPSGRTDPAG* 

>ORF1640c (SEQ ID N0:17) 

VRLGLAEVIRVRVEPADQPVPDLLSAHARSDDAGVRQAALDTLADGFQVRRRVLSANLRILLADPAVDDHPHAALLSSPG 
QLADGGVDEQSARLRNALQVMLHERQVHLAEGRSSRSRMDVLPQLLLEIVPADLVLGVELDNPG* 
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>ORF2228c |SEQ ID N0:19) 

6EPAQVAVQRSDFVLRFDRHQAGAEQYVAQGDAAFLQLGMVAAABEQSDRSAIELVLPADQASNGGHVLAGVDRGLGVTN 
GLFRELLQMPFQHVLRRHDHGDAQQRVLGHSIEQGVEAIAFERLAGEVACQRRFVQLVDHNHSVRHGIEEPYALGGIGNR 
LPIEQVLGEVSAVGQWQFILQDRFVRTAYQFVEAT* 

>ORF2068c (SEQ ID N0:21) 

SLCSRPIRRRTVATSSRVWIEAWVSPTAFFESCCRCPSSMSCGVTTMVMHSSVSSGIRSNRALRPSPLSVSPVKWPVSGA 
LFNWSTTITRCGMASKSHTPWVASETGCR* 

>ORF1997 (SEQ ID NO:23) 

HPGLDPHPRGRGHRSTPDRPGAQALSRSDRSVRARPRPYPVGGTLHHPEQRTARRRPGAGRTAERSRTAEQLPALAPLKL 
RSKREASPGVVHPDDVRSMRQPVAHLGAHHRYRTPV^HAVQPWRRAVDLRPVQQAGPADECPRLHLRANWLRQVGVPDQ 
PHLPDARHVPAADVRRGSGQQLRPAGRLSQAVWPLGPPGAPRPGLRRQPGAVRGRHQAGREPRPSEGAGRRRHRGLGLGP 
GQQGRPRGRPARHPGRDGDRRPPHDYRWRREGRCAPDPCRSQRRPPGDPGGGQDLRRREPHGTDPRRARCALRGLQER* 

>ORF2558c (SEQ ID NO:25) 

VGQQAEAVARFRDEHPRQVHGEHLADEVGQGRRLAGASWPEDEAVGIHLPVQLVERVEGQRRAATVEQREARVSGTGGAP 
PDGRQVGDVLSEHHLGVPLQGSLLVWIEV* 

>ORF2929c (SEQ ID NO:27) 

SASRTSWVSTVRFAAAQVLAAARIAWRTALRSARVRRASSFSSPPVIMRRATISISPRMSRWSSSRSALLPWTESEASHS 
SASSTFTWSGLSTSLHASANGARLTPEPGARRTRWTERPNRLAKSASRPKLLPASATNIRGRYMASIWQMRLVRDADLPE 
PVGPKMKPWAFICRSSLLNGSKVNGAPPRLNSVKPGCPVPWRPQMGDRLAMC* 

>ORF3965c (SEQ ID NO:29) 

APVGPYQAVDWAAIHPRAALSAGRYPGDRLPSVESAAAPLSVQERISLASAGHPLRGSAGSGSGCRSGSGSANSELSFV 
LALHCRLVWEMGEGWQAARLVEIRCDLQGRLELVMGGLHDWALVLFFVFGHGQGQVETRGNHTDEVFFALGQEHALG 
ELAAFLAGREHHRLLRRRQLAEPGYLLYFVGGHVQAQPPLDHVQHRRPGGWDVVDVLGGEPEPGAQFPPHLGDLHGVGQQ 
QRVLGDDVPLIGDLDDWPALELVAFGDWHGVQQRDVGDPELGGVAFARVRCEIHHGKVGLRPGRAFAIEHLAVGAHAEH 
LHGFRHFGDPRAAFWRGAIAPGGLVERIAHVLGQYRAVRGGAGPGRRQDRLADGAAIGTGQARIFLLFATGNHEAGDDLR 
LAQDVSLVVLEVGLAALDRVRGLDVFGVQHLHLVGALDQLDGVRERRQADAGARGEAHPVDREMPLG* 

>ORF3218 (SEQ ID N0-.31) 

GAHHHQAPAAMLRHEDHQDVAETGRLVLARHPEHRRHPSLRGADAEHDRWWPEHAPRRSREDIQVPRAVAGABVDDA 
LGPQGKRQVHRGRAPGQGQRIPRPCGSPESLPGPGHDRKRRKEPALQHHASHRLRRARGGLAGRSGSRQGARPATLPHCF 
PRPTGSGVPGRMRVLNSLTQNLIDNLTQILQNPEEDALQTLRICAPVLIEELQQIQLRAVDRRDIVPQIKQLLDEWLQQH 
PQPDTAQQALIEAVDRAEILQRRQA* 

>ORF3568 (SEQ ID NO.-33) 

PKTKKRTSATTSCKPPAATSSRRPCRSQRISTRRAACHPSPLFSQTNRQWSARTNESSEFADPEPDRQPDPDPAEPRRGC 
PADAKDMRSCTDRGAAADSTEGSRSPGYRPADKAALG* 

>ORF4506c (SEQ ID NO:35) 

VNKFWFRTFLQSSLVQFRKVQCAARQPAPVAGRLSEDRIDSAPEGFGAALDPRALHQASLVAGRLAMHLQGKMAPNQVH 
VRMAVYPALKPRGVDLAEGALQVGVFIDRPARFRIAVEAWGWQALHQKLYPYGGCSQQDQQQPRPGQGGTLKSFGCPAA 
LQESHACLRCRISARSTASMSACWAVSGCGCCCSHSSKSCFICGTISRRSTALS* 

>ORF3973 (SEQ ID NO:37) 

GRGPRGDPTAEASVRLLKGGWAAKRFQGPALPWAGLLLVLLAASAVGVELLVKGLPANHSLYGDARARWTINEYADLECP 
FCXVYTPRLKRWVDSHPDVNLVWRHLPLQMGEAARHQARLVECAGIQGGAKAFWSAIDAIFAQSAGNGGGLPGGTLDFP 
ELDQARLEKCAKDNELIDSDIKLDIDIARSKGITATPTLVIRDNQTGRSVKLEGMADETTLLSAIDWLAKDL* 
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>ORF4271 (SEQ ID NO:39) 

TWFGAIFPCRCMARRPATRLAWSARGSKAAPKPSGALSMRSSLSRPATGAGCLAAIM 
ISSWTSTLHGRRALQRPRPSSSGTTRRDEA* 

>ORF4698 (SEQ ID N0:41) 

EIGEDSNIPLLVLQDALHFTWQNLDLLPIHNLYHSLVAGAGEAKPQLHCRPSIDVNALEQALHDFDHSLISVSQLHTGIM 
LPRTCRRHPYLCTWQRSITARKNTPPTS* 

>ORF5028(SEQIDNO:43) 

FPAALSEVILSAVCTFLEPVQTHASSSLPPWPAATNAGRWRTTGTAEQRESGRNLGHHRQGSSGLCHRIVARSVSGRPGT 
PRGATDCGLAPGSTACSSGV* 

>ORF5080 (SEQ ID NO:45) 

NRYRPMPLHHSPPGRRPPTLAVGVLLVLLSSASQAETWVITDKAHPVSATGSSRVLFLDAQEHLEEQLTAALPQDPQHAQ 
AAFKRLLQSPDGRRLQAELVKAQQDVADAWSLGVEKIPAVWDRQYWYGEPDVSRALELIAKARRSR* 

>ORF6479c (SEQ ID NO:47) 

FVSVSLLEVGTADEHLPLALAAGVGTPERPGVLPVDGLRLRPRVGKHRAVEAQGWGQLLPFPGRGIALFQLARRPVAVLG 
GCAHGEVDVELADSRGDIAGALGDDGCRLVVVGLVQEAAARIEVPPHVAGEDSTHLAQPWDQRFGVHLLGNSMPPANGVQ 
CAEKVRHQRDGGARANVPRGAGEPAERGATRMADHIRFLEAADAVLGLVVCGRVIAGLGEWIRCTQRRYLGPGVAPGIRV 
AGDDCVRHVVADLDRRLHFAAMRAAEQPVTDPDDLVFEALRGKGGGDDGSAVDRGRGREREAEGGGRRCQAAEVEAGHQR 
DLLALAISSRARETSGSP* 

>ORF5496 (SEQ ID NO:49) 

ANRQGQEVALMTSLNLRRLAAAAATFSLSFTASMINSAAIVSSTLSPQC^^ 

VPDAWSSYANTGSNPWTEVSALGTPNPLAQAGNDATTNYKAENSIGRFKEADVIGHPGGATFSRFASASGYVCPGATVP 
LVPYFLSTLDAIGWRHGIPEQVYPEALVPGLREVGGIFSGDMWGNLYPRSGFLHQTDDYKTAAVIAQRAGDITTRIGQLH 
VYLPMRAAPKDGYWPAGELKEGDASTGKWQELTPSLSLNCAVFPNSGPKTQAVDGEHAWALWRPYSCCQRKGQMFICSTD 
FQ* 

>ORF5840 (SEQ ID N0:51) 

RDHKLQGREQHRPLQGSGCDRPSWWRHVQPVRQRLWVRLPWRHRPAGAVLSQHTGRHWLAAWNSRAGVPRSVGPRAARGG 
WNLLRRHVGEPLSAQRLPAPDRRLQDGSRHRPARRRYHHANRPAPRLPPHARSPQGRLLAGGRAERGRCLDREMAGADPI 
PEPQLRGVSQLWAEDASRRRGARLGALASLLLLPAQGADVHLQYRLPIRTRRRIMRMNITSVALMWLLAAQLAQADDPIN 
VSKTGTVLSDEVLYSIGGGSAVSMGSAGQMDSIGVGFGWNNDMMCGNMNLSTTLENQLNGATQGFQNIMGSVIQNATGAV 
MSLPALIIQRANPQLYNLITNGILQARIDYDRSKGTCKTIAEKMADIAGEQTGWGKIAEGQALGATLASDGKDAVSALEA 
VEKKGGNDGVTWVGGDKAGGSGQKPIRIVin)OTRAGYNLLT^^ 

EQQQQTCEGCQKTVTAAGVGLTPLIQETYDKKLQSLQELLSKSKPLTAENLAAAGTDALPITRGVIEALRDERDQDVLAR 
RLASDVSLMDVLSKALLLQRLMFAGAKEPNVAANGLATQAVDQQTSLLQQEISNLKTELELRRELASNSPMRVIERGQQR 
ASGSSGVFESAPDADRLDRLQAPSAAGGKSGGRP* 

>ORF5899 (SEQ ID NO:53) 

SAILVAPRSAGSPAPLGTFALAPPSRWCRTFSAHWTPLAGGMEFPSRCTPKRWSQGCARWVESSPATCGGTSIRAAASCT 
RPTTTRRQPSSPSAPAISPRESASSTSTSPCAQPPRTATGRRAS* 

>ORF6325 (SEQ ID NO:55) 

ASTARCFPTLGRRRKPSTGSTPGRSGVPTPAASARGRCSSAVPTSNKDTETNHANEHHLGRANVAARSATCPGRRPDQRV 
QDRHGAQRRGPLQHWRRQCGEHGQRRPDGLDRRRLRLEQRHDVRKHEPEHHPGEPAQRCHTGFPEHHGLSHPERDRRGHV 
AAGVDHPAREPSALQPDHQWHPAGADRLRPLERDLQNDRRKDG* 



Fig. 4-3 



57/133 

>ORF7567c (SEQ ID NO:57) 

QCLAEHVHQ6DI6RQAARQDVLVTLVAQRLDDAA6NWQSI6A6RSQVLCSQWFALRQQLLQRLELLW6LLDQR6EADAS 
SRHRLLAAFA6LLLLLPQYP66ECG6LL66GPSVADQAVVAS66RHARRIIHRAA6QQWARP6HWDDAN6LLA6AAGL 
VSTNPGYAIVAAFLLHCPEGGYGVFPVRGQCGAQGLAFGDFPPAGLLASDVSHLFGDRFASPFRAWVDPRLQDAIGDQV 
VELRVRALDDQRRQRHDRAGRVLDD* 

>ORF7180 (SEQ ID NO:59) 

FVERAFRHLQQRPGLQHLVLPPGGRRIRHPGTGGATATDLRRLPEDGDGCWRRPHPADPGDLRQEAPVAAGAAVEEQTTD 
CREPGCGRHRCSANYPRRHRGAARRA* 

>ORF7501 (SEQ ID N0:61) 

PGRPGAPPGVRCLPDGRAQQGTATAAPDVRRRQGAQRRRQRPGHPSRRSADQPPAAGDLQSQDRTGTPSRVGQQLPHAGH 
RARATTRLRVQWRVRVGARCRSPRSPAGPLCRRRQVGRETVMADTLTTRKLLGQLLVGVLIVIGLAWGTLLSLFALNHF 
GGIQGLEAWRQSNYWSLFAWRALLYCALAIAWFRQRKELSAHERQRIRRIEILVLLLVLLIEFSKAYFRTGGAA* 

>ORF7584 (SEQ ID NO:63) 

CSPAPRSPTSPPTAWPPKPSISRPASCSRRSPISRPNWNSVASWPATPPCGSSSAGNNAPQGPVACSSRRPMPIASIACR 
PPLPPAASREGDRDGRYAHHPKASRSATGRSADRHRTGSGRYAAQSLRPEPLRWHPGPGGLAAKQLLELVRLAGAAVLRP 
GHRLVPAAQGTERA* 

>ORF8208c (SEQ ID NO:65) 

RSCCASRAEVGFAEFDEQDQQQHQDLDPPNALPLHRAQFLALPEPGDGQGAVQQRPPGEQAPWALPPGLQALDATEWQ 
GEETEQRTDHCQSDDDQHSDQ* 

>ORF8109 (SEQ ID NO:67) 

AAAHSADRDPGAVAGPAHRIQQSLLPHGRRSMTFMTNDYLEYYLTLLGWIINNGIWNMISDTGLFAVPFAAIVMREWLKV 
RGEGADEGNKGVLSLARIETHIYVGYIWALAGIPWNVSFDTIEFDQTRAQQCQYNLPAPADTGWSSSFSSLAGKSAQM 
PLWWAHMHALSKGFTSGAIAAIPCGTDLRQMRHEVDNTRVNNPLLAQEIADFSRDCYGPSRARLFMRQPDLGSVAEDNKA 
LQDLNWIGSRFLLNTPGYYDTDYSKSPRQSWPYNATRDAGLPQVGGGGGYPTCKQWWADSGIGLRDRIKDQVDPDLMTSF 
LKWAKWLNQDEVTEAVIRQVISPSSQVKGNVHDYGGQVGGTVWG 

LKMAMVICIPmVIGTYQLKVAMTMTVVFFAMMFVDFWFQLARYIDSTILDAFYGSGSPHLSFNPVMGLm 
FVMGSMFIVLPLLWHTAIGWSGIQAGSVLNGLSRGTEGVQAAGKEAGNRVKNAV* 

>ORF9005c (SEQ ID NO:69) 

VSPPLLAGWVAATTAHLRQAGIAGGWGPRLTGTLRWGVWPRGVQQESGADPVQVLQRLWLGDGAQVGLPHEQPRTG 
RPVAVSGKISDFLCQQRIVHARVVHFHSHLPQIRAARNGRDGAAGEALGQGVEHRPPERHLRTLAGQAAEGARPAGVRRC 
RQIVLAILGASLVELDGVEAHVDDRDPRQGDHDVADVDMRLDAGERQHSLVALVGAFPTNFQPFAHHDGRERHREQASIR 
DHVPDPWDDPAEEGEVILQWIGHEGHAAPPVRK* 

>ORF8222 (SEQ ID N0:71) 

LPGVLPHPPRLDHQQRDLEHDLGYWPVRGAVRGHRDARMAESSWGRRRRGQQGSAVSRPHRDAYLRRLHRGRPGGDPGRQ 
RELRHHRVRPDSRPAVPIQSAGTGGHRLVELLQQPGRQECADAALVGDDARPVQGLHQRRHRGHSVRHGSAADANGSGQH 
AREQSAAGTRNR* 

>ORF8755c (SEQ ID NO:73) 

QSLEKSAISCASSGLFTRVLSTSIRICRRSVPHGHAAHAPLVKPLDRACIIAHQSGICALLPARLLKELDQPVSAGAGRL 
YWHCWARVWSNSMVSKLTLTTGIPARATTM* 

>ORF9431c (SEQ ID NO:75) 

LKPEVDEHHRKEDDRHRHGNFQLIGADDQDHRNADDHCHLQERHHHRQCLADHIHRREVCQAAHRNAEGSCGSRDAVPHG 
AAHLPAVIGVDVTLDLAGG* 



Fig. 4-4 



58/133 

>ORF9158 (SEQ ID NO:77) 

RLHRLRRA6GRHRVERHRENRRNLRRC6GQL6ILP6DGYGPPGTAD6DVVPEDGNGHLHSD6P6HRHLSTESCHDDD6RL 
LCDDVRRLLVSVSQIYRQHDT* 

>ORF10125c (SEQ ID NO:79) 

VIAGCLPLGMRLMIAHTNKGFASRIGFGLGMLVRFCLHDRRPALRWVKRVSLFLLVALWSQNFMWLAGVSMTLLCVF 
LVGFALVKGDISVSKGSPSRDVSTMTSQAETESVAELFDYQAAHHYRD* 

>ORF9770 (SEQ ID N0:81) 

SNSSATDSVSACEVIVETSRLGDPLETEMSPLTKANPTRKTHSRVIDTPASHIKFCDTTRATNKNRLTRLTQRRAGRRSC 
RQKRTSIPRPKPIREAKPLLVCAFIINLLAPKGRHPAITYTPKKMIWQALWHIMPLAICRLEYLMATRNWLPDPLEQDI 
NELVETGRYQNRSEVIRAGLRLLLQQEAQIAKLETLRNATSSGLMQLERGEYDEITSDELAQYLDELGNQASH* 

>ORF9991 (SEQ ID NO:83) 

SWTAIMQAETHKHTQTKTDPGGKAFVGVRVHHQSPGSQREASCYHLYAEKDDLASIMAYYATSYLPTGVPHGNAKRRPSR 
SAGAGYQRAGGDRPLSESQRSHPGRLAPAAATGSPDRQARNPPQRNIQWADATGARRVRRDHQRRTGPIPRRARQPGEPL 

KHGQVPHLS* 

>ORF10765c (SEQ ID NO:85) 

HLVCRHPVEDEVPGPNNLTDIGHRVAVNEVDAAQASSQFFTADAAYLLWVCRNCFQRRPYECLVTSARGIAEVVVGEAQD 
IDDVRLGIMRDAVLGHASVARLVAELVEVLGQFVAGDLVVLAALQLHQPTGCCVAEGFELGYLGFLLQQQAQACPDDFAA 
ILIAAGLHQLVDILLQRIGKDDVSRCHEVLQSADS* 

>ORF10475 (SEQ ID NO:87) 

SMAKYRISHDAQADIVDILRFTHNHFGDAARRRYQALIGAALEAVATDPQQVGSISREELGAGLRSIHLVYCHSMPNVGK 
WRPRHFVFYRVATDQVLEWRVLHDAMDVDQHLPQR* 

>ORF11095c (SEQ ID NO:89) 

SRMQAWSTNANAWSGGMQSSGQATAIAHQPWGTCWWMFTRSLFVFAGANAALSAFRQALSGRAFTLVNHSLRPSSPFPL 
WAICSCYSCSSLGQVLIHIHGWKHANHL* 

>ORF11264 (SEQ ID N0:91) 

TAVRRDLLKLMGCTHIEADYIGGLRCSTAPEGTWVAHGFHGPIVDVIDDSAGFFSTHRLALHYPAQCGLAVDQAIPRTAI 
HVASPLMHVCIGKVWISAWMC* 
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>ORF11738 (SEQ ID NO:93) 

EBVIMKLQAyRLQNYRRLRDWIELDDEISIFVGANNSGKTSAVQGLYSMLRGEVKKFELFDFSAALWAEIDAVGRTPPG 
DEDAPKRLPSILLDLWFRVGEDDLATAMSLLPSTEWDGKCVGIRVAFEPRDMELVWKFHELHEKANNAAVALAAKRKAA 
GEQAVEAGAEDAAAWADAGEYKPWPESLTKYLTKELSKEYTFRYYVLDERAFVGYQAREADYEPLPLGKEPGGAAILKS 
LVRVDFLRAQRHLDDPDAGSSDRAESLSRRLSRFYHRNLEKRGDDHAALKALDTSEKELNFHLKEVFNDTLTRLAKLGYP 
GVNNPEIVIRAALDPTTVLGQDAKVHYVIPGVASAQLPDSYNGLGF 

PEAHLHAQIQQVFIRmRLLEDANDHATLFHTQLVITTHSPHILYERGFSPIRYFRRVMJQLGHHTDVmSLFKTGAS 
DAPAREFLQRYLKLTHCDLFFSDAVILVEGNVERLLLPAMIELVAKRLRSSALTILEVGGAFAHRFQELIAFVGLTTLVI 
TDLDSVTVKTDAEKAAAQGAGAEGAVDGDDEDEDDDLKPFELEDDDEAEPSGKKKSKKRGSTCHAHVEGAVTSNQTLISW 
IPKKRSMAELWEVTAEQKTLSLAEDSSAGVRVAYQTKVSVTVGATTSQLCGRTLEEAFGLENADWCQAEANRSVGLKLKR 
APSSPEELMKLHDRWGKNFDKTRFALEVLASGPLNGWKVPAYIAEGLAWLEAKVAHELEADAAIATEVATIEPTTADV 
VAIIVDPGQTA* 

>ORF12348c (SEQ ID NO:95) 

RKVYSLLSSFVRYFVRLSGQGLYSPASATTAAASSAPASTACSPAALRLAASATAALLAFSCSSWNFQTSSWASRGSNAT 
RIPTHLPSHSVLGSSDIAVARSSSPTRNQRSKSMDGNLLGASSSPGGVLPTASISAHSAALKSKSSNFFTSPRSIEYRPW 
TADVFPLLLAPTKIEISSSSSMTTSRSRR* 

>ORF12314c (SEQ ID NO:97) 

GTSSGFLAKACTRRHRPPPQPRLPRPPPQLAPRRPCAWPQARQLRCWPSHVVHGTSRRARGHPEARTLPGSRRTCRPTQC 
SAAATSQWRGRLHRRGTRDPRVWTVTFWAHPHRQGASCRPHRSRPTAPH* 

>ORF13156c (SEQ ID NO:99) 

RQIAHIRVMAQLWDAAEVPNGRESAFIEDVRGVRGDDELRVEQSRVIVSILKEAQNVPDEDLLDLRVQMRLRLLNEDQM 
KRSSLVILGFPLLVQVEQLNHHVDQILEPQAIVAVWQLGGSYARDHVVNLGVLPQDSGRIQGRPNHDLRIVDARIAELGQ 
AREGVIEDFLQVEVQLLLRGI * 

>ORF12795 (SEQ ID NOrlOl) 

LPPNCQTATMAWGSRIWSTWWLSCSTCTSSGKPRMTSELRFIWSSLRSLRRICTRRSSRSSSGTFCASLRMLTITRLCST 
RSSSSPRTPRTSSMNADSRPFGTSAASTTSWAITRMCAICRYSKRARPTLQRANSCSGI* 

>ORF13755c (SEQ ID NO:211) 

ATRTPALESSASDSVFCSAVTSQSSAIDRFFGIQLMRVWLDVTAPSTCAWQVLPRFLDFFLPLGSASSSSSSSKGFRSSS 
SSSSSPSTAPSAPAPCAAAFSASVLTVTLSRSVMTSWSPTKAISSMRCANAPPTSRMVRAEERRRLATNSIIAGRSRR 
STLPSTNITASEKKRSQCVSFRYRCRNSRAGASDAPVLNSDRLRTSV* 

>ORF13795c (SEQ ID NO:213) 

CRRTfflUIRNLGLVSYPNPSAGILSQRQRLLLRRDFPELCHRPLLRDPADEGLVGRDGTFHVCMAGAATLLGLLLATRFCF 
VWFKLEGLQVWLVLVISVNGALSACALRGGLLGVRLDRHAVQIRDDQCCEPNEGDQLLEPMRERTTDFKDG* 

>ORF14727c (SEQ ID NO:215) 

QEVGELKDVLVAKYALGWTAHAWERPDAGHSLQASDISLLVGLVPVARGLPDTRAVILELFFKFGDPPADWLQPNLD 

VGLERL(^PmVDGRDMNQRVIVDVRQDFIGGKVGIRDARDYLLPRPHAGAVLRDHPVQRLDEGGGLAGTCASTNHEG 

LRRRRYNACVDLAVGVGIWAINSSAHAVCPGSTMIATTSAVVGSIVATSVAMAASASSSWATLASSQAKPSAMYAGTFQP 

LSGPLASTSSAKRVLSKFLPTTLSCNFSASSSGLLGARLSLRPTDRFASAWHQSAFSRPKASSSVRPQSCDWAPTVTET 
LVW* 

>ORF13779 (SEQ ID NO:217) 

RWVRRHHSSAAAHLRRPLVLRTRTGARLRQTGRSASSSSAHRAALKSWLRSYTIGWSARTSTRPALRWRYSQAGRSMAGR 
FPRTSPRAWPGSKPKWPTSLRRMLPSPPRSRLLSRLQPMLSLSLLTRGRRHEQTN* 
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>ORF14293c (SEQ ID NO:219) 

6RWSCRNLRQHEPRRAAEASLQCMR6SRGRRRYLGYQFVCSCRLPRVNNDSDNI6CSRLNSRDLG6DGSIRLKLVGHF6F 
EPGQALGDVRGNLPAIERPACEYLQRKAGLVEVLADHPIV* 

>ORF14155 (SEQ ID NO:221) 

PGADGMSRRIDSPDTDADREIHACIVATPPQPFWRAGAGSGKTTSLIKALDWVISEHGASMRARKQIVACITYTDLATN 
EILADV1TODPLVHVSTIHSFYWSIAKTFQADIKVWLQNDIRRRISELEEEFENYSSRVRQTTRDRNKADQERYVRSLEAV 
AGVRTFNYGVGSDYAKGILGHEDILQLADFLLQNRPLFRRWALSYPFVFIDESQDTFPGWKSFKEVEAQMQGKFCLGF 
FGDPMQSIFMRGAGDIQLEDHWRAITKPENFRCAKQILDVANAVRAQGDGMEQVRGLHERVDGNLKLVEGSARMFVLPNT 
IMTEALMVRAWSSAT1M)EGWTTPDIAVKILVIVHRMAAN^^ 

LPIVAAVKAGNEFAAMSLLREFSPRLAPAALTGRRAADVLRELHAAASRLVAMLDEAGTTIGDIALHLCDTGLFEFDERY 
ARVLGFVRDIADTAQEPEAADAVPAEGLSLDATMAKFFNCSAQELWPYERYVSEGSPYATQHGVKGAQFERVMVVMDEEE 
SDYRTYNYERVFASAEARAADRARALDGDENTWSRTLRLLYVCCTRAQRGLVLAFFVADPATTLENWASGILPRSAVFT 
QEVLVGWP* 

>ORF14360 (SEQ ID NO:223) 

SRASRIPTLPPMKSWRTSTMTRWFMSRPSTAFTGLLQRRSRPTSRFGCRTTSAGGSPNLKKSSRITARVSGRPRATGTRP 
TKSDMSEAWRLWPASGRSTTAWAVTTPRAYLATRTSFSSPTSCYKTARCSDGSWR* 

>ORF15342c (SEQ ID NO:225) 

EGSNGPTGAVLHPLLHGIRRLWQRRVDAAEAKPVCGHAVHDDKNLDCDVWGCPTLWVRRRAPRSDSCQSLGSVQRVRQ 
DEHPGRPLHQLEVPIDPLVQPADLLHAIALRAHGIGDVKDLLGAAKVLRLRDGPPMILKLDVPCASHEDRLHRVAEKTKA 
ELALHLGFHFLERLHYTRERVLTLIDKHERVAQRHDPSEQRAVL* 

>0RF1 5260c (SEQ ID NO:227) 

MPPKPSRFAAMRCTMTRILTAMSGVVQPSSLFVAELHARTLAKASVRFSVFGKTNIRADPSTSLRFPSTLSCSPRTCSMP 
SPCARTALATSRICLAQRKFSGFVMARQ* 

>ORF14991 (SEQ ID NO:229) 

RRQCRARAGRWHGASPRAAREGRWEPQAGGGVGPDVRLAEHAEPNRGFGKSPSVELGDEQRRGLDNPRHRSQDSCHRAPH 
GRKPAWLRRHLLGAERQDVGCHEARDAGRHRLARSTLPKFCATDRCSCEGRQ* 

>ORF15590c (SEQ ID NO:231) 

RSSNSKRPVSQRWRAMSPHVVPASSSHATSLDAAACSSRNTSAARRPVRAAGARRGLNSRSRLIAANSLPAFTAATIGSA 
KLRKGRTGQPVPSCIPCFMASDVLSFSAE* 

>0RF1 5675c (SEQ ID NO:233) 

SFGRNCISSLGLLSGVSNIPDKPKNTRIALVELKKTRVTEMESYVTNGGPCLVQHGDKPRRSSVQLSQYIRGTSAGQSRR 
RQARAEFPEQAHRRELIAGLHSCNDR* 

>ORF16405 (SEQ ID NO:235) 

IDSLRKCVGSLEKCCFACKEIIHVHAIRCRQCGESQGWRRFMSSPTSWALVLSLLSIAATKPVERLFDAQRAELQISIT 
GGDYKAAQLMLTNNGSKPATLVSFEITSKATTNTKTWFLVSNTDGEILEPGKTYKIRASTDESIPKIVEAERRTILKSQY 
ALADNCELTAKYIEATGQKWRVQPFMCDTPPEKGGLPPGKPGIPIWYLGQE* 
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>ORF16925 (SEQ ID NO:237) 

RPRGRRLOTRSCATHLLKRVACPLWIAYPFGTLV^ 

WPYAYEVNSESVQWTIFLLGVDCSGKVIYFRNTARV6PFLAASIYRPWYGSDALVLHFTK* 

>ORF17793c (SEQ ID NO:239) 

AKMIVIDOLEHLVAQCAICEKTLFDEFSLKIQLGHTYYEPKSLPASASIVYGSHPAPSTFFLEPKEIQQNLVLKSGEQV 
ITCSKHRYKIPLDYFGLVQTKGTLARLFVQVTCNDGQVEPGFDGYVTLEIVNMSPWTIEIPAVSDIAQLYLVKCSTSASE 
PYHGRYMDAAKKGPTLAVFRK* 

>ORF18548c (SEQ ID NO:241) 

RTMAGWPRLAAQGRRTNLMSVLQIKGRTTKSHTDFDAASYSSNSLILTDAGDERIEEFSLELSVGEGWSDNYSGNDKNLW 
RIVDGMTIRGHDSVWEAAEEIKVPHNRYGIVLPTGSLFLSRGVLVASAKVEPAFDGKLKLRIFNTTNKHVCLTKGEKLG 
SVIFFSTESTHTQSPIKRGSEISTLPITRRARLKKWFSLNPTIWVGWTLNLIGSSLVSSLIMYAVYYKWLEHQSQPPQS 

QQNAQPSPNEVKPK* 
>ORF17875 (SEQ ID NO:243) 

TAYIIREDTRELPIKFSVHPTHPGLSENHFFNRARRVMGSVDISLPRLMGLWVCVDSVEKKITEPSFSPLVRQTFLLVV 
LNILSLSLPSNAGSTFAEATSTPRERKRLPVGRTMPYRLCGTLISSAASTTTES* 



Fig. 4-8 



62/133 



>ORF18479 (SEQ ID NO:245) 

SVTHSSDLSPVLGLRDAATLPLSFIPADIPGYRLKDDVRKACTNLNFKRLAVIV6ERERHRPYITWRQHTGTERYPASEQ 
RASRKKKRRQIFRQIEFFHGARQISLARFHDEAVIRVCEHDLAGRGASRRFSQASTPYCQAREACESEVKSNAFRGGQLT 

VGKVLD* 

>ORF19027c (SEQ ID NO:247) „™„„ WIt1 „ 
MIYSPHSLLKLVRDGKLIKHLAHRELTTPEGVGFDLRLAGLSRLTVGGGSLRESTRRTPASEWLADPDDCFVMEPGKTY 

LASTMEEFDLPEDLAALFFPRSTLFRSGITFSSSVLPPGYVGPMTFALTNNHSEAFEIQIGARFAHVIFQAVSGDIGRYK 
GQWQGGRVSQPKDEGQI* 

>ORn9305 (SEQ ID NO:249) 

WPFSACRLFGMTGQVGCKRWSAPMQLGGHVRCNYAVEPGPVPPKQSIRPRWHIANKIPFPATWLSLLPALIWRKSPLHE 
SSWSLPCFNSFPGYPGSRPPPQQPKLPQGDSSFL* 

>ORF19519 (SEQ ID NO:251) m % tmm m %m 

SGGKARSMNRHGASHVSTPFLDIQEAVPHPNNQSCPRGIHPSSEQHGTARHASPPAATGEHLAARLAIQAAIRGDLPAAT 

GQLCRAGPAAPCFGKSSPCPSRRDDRSRPGDRGLRTQGTADLPAPDRRSAGVTVSPG* 
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>ORF19544 (SEQ ID NO:253) 

IVMEPPMPQLLSWISRKPSPTPTTKAAP66FILPLSSMELLGTPRRRQLLENIWQRASLSKQQPEEIYRRPLANYAELVQ 
QLPASENHHHMPGGMIDHGLEIVAYALKVRQTYLLPIGAAPESQSAQAEAWSAAAAYGALAHDIGKIVVDLQVELQDGS 
TWHPWNGPINQPYRPKYVKSREYQLHGAASALLIHQLLPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVKADQA 
SVAQELGGNPDRALMPKQSLQRQLADGLRFLVKDKFKLNQPSGPSDGWLTQDALWLVSKPAADQLRAYLLAQGIDGVPS 
SNAPFFSMLQDQAVIQTNAEDKAIWTATVDNGAGWRNKFTLLKIAPALIWTDAAERPSPYSGSLWEDGTASTEKPETTC 
EIPNGPAEQQQAPETKMMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRG 
EENLQQPLGTKEPTDCAPEAIEDVFlJPSRSTDLGQGFVGWMSGIAARRLFIiroTKALVHTVDGTAMLVTPGIFKRYVQE 
HPVLEKLAQAKETTGMLVQRAFEKQGLHRKTSKNLNIVraiKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDA 

EGGVE* 

>ORF20008 (SEQ ID NO:255) 

ATGRQHLAPLERTDQPAIPLQVREVPRIPAPRRCLSTSHPPTATAHCTRLAQSLSRAVGSIDLPVRWAVRARRDPRRDHR 
EGRPGLSCTGARRQSGSSSGCTEAVAAAAVGRRPSLLGEGQVQVEST* 

>ORF20623c (SEQ ID NO:257) 

RELVSPSSTVVYRGRPNGLVLGICLDDGLVLEHAEERRWGGHPIDTLGQQVGSQLIGSRIAHQPECVLGQPSIRRAARL 
IQLELVLHQEAKAVCQLPLQRLLRCSQSSIRIAS* 

>ORF21210c (SEQ ID NO:259) 

RLKIPGVTSMAVPSTVCTKALVSLMNRRRAAMPDFIQPTNPCPRSVLLLGINTSSIASGAQSVGSLVPSGCCRFSSPRVF 
VGEAAGESWLVSSSSSGELILPRSAYKSSVSS* 

>ORF21493c (SEQ ID NO:261) 

AAAASFHTSFGIGDDREAWWQRLLREQQFGILEQVGLELFRLARTRNLDGPDVQVFTGLPMKPLFFERALHQLPAGRLL 
GLGQFFKHRMLLDIALENSWRDQHGGPVYGMHQSLGWDEQATGRDARFHPTNESLSQISTSARHKYIFNCFRSAICWLL 

GP* 

>ORF21333 (SEQ ID NO:263) 

TSGPSRFLVLARRKSSRPTCSRIPNCCSLSSLWTTQASRSSPMPKEVWNDAAAAHRGVHLRARSPGSQREDLPRRDQGAA 
QALRSYGNRTGRGPPGCPGMATQGTGTRPVQAELEHVLESSADDLGLCHRA* 

>ORF22074c (SEQ ID NO:265) 

VRHKPLRYWHYELFVSLLCLAADEDQLIFPVDVALADAQQRIQPDAGEVEDLKRAKPEPGGRDPCAFAGAALCAVLHVEP 
IAGEQDGFGGDGFACSPGGNDGGLSERVDLGVRHQLMLDGIAPDRPQMIRVRVPAPLGQALFQYLASPSQDSPWHVLYG 
CRRTEVLEQRLGRGAVDLRAGFPEIVREDVLLGELLRRHSTPPSASVMTVRLGLSRGCSGNSNLGSWSR* 

>ORF21421 (SEQ ID NO:267) 

AASGQPKPHGIfflRCRRRCGMTPQQLTEEYIFAHDLREASAKIYRAATKALLKHFGPTATVQDVDHRAVLGWRRKVLEQGL 

SKRSMTYSlfflLRTIWGYAIEHELVTHSQVNPFRKTTVIPPRRASKTVAAEAILLARNWLNMQDGAERCTGERARITPAW 

FWLCTFEVFYFTGIRLNALLCIRKRDIDWENQLILIRGETEKTHKEFWPITEGLVPHLSRLLQEADRAGFADDDQLFNV 

NRFSPETKSKVMSDQVEAMYRKLTEKVGVRMTPHRFRHTLATDLMKAPERNIBITKCLLNHSNIQTTMSY 

AVLHARSLAQGALENVRKVDYSGSPQASAKPKPCGQPLARVSEAPPPEARTEPAEPREHTPGTGIQGGPTAWEADALPQP 

PDTFEPSVLFTLMAQNLSNRAASASAAPAATSGSGGWGSAARSNLA* 
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>ORF20008(SEQID NO:255) 

AT6RQHLAPLERTDQPAIPLQVREVPRIPAPRRCLSTSHPPTATAHCTRLAQSLSRAVGSIDLPVRWAVRARRDPRKDHR 
EGRPGLSCTGARRQSGSSSGCTEAVAAAAVGRRPSLLGEGQVQVEST* 

>ORF20623c |SEQ ID N0.257) 

RELVSPSSTVVYRGRPNGLVLGICLDDGLVLEHAEERRVRGGHPIDTLGQQVGSQLIGSRLAHQPECVLGQPSIRRAARL 
IQLELVLHQEAKAVCQLPLQRLLRCSQSSIRIAS* 

>ORF21210c (SEQ ID NO:259] 

RLKIPGVTSMAVPSTVCTKALVSLMNRRRAAMPDFIQPTNPCPRSVLLLGINTSSIASGAQSVGSLVPSGCCRFSSPRVF 
VGEAAGESWLVSSSSSGELILPRSAYKSSVSS* 

>ORF21493c (SEQ ID NO.-261) 

AAAASFHTSFGIGDDREAWWQRLLREQQFGILEQVGLELPRLARTRNLDGPDVQVFTGLPMKPLFFERALHQLPAGRLL 
GLGQFFKHRMLLDIALENSWRDQHGGPVYGMHQSLGVVDEQATGRDARFHPTNESLSQISTSARHKYIFNCFRSAICWLL 
GP* 

>ORF21333 (SEQ ID NO:263) 

TSGPSRFLVLARRKSSRPTCSRIPNCCSLSSLWTTQASRSSPMPKEVWNDAAAAHRGVHLRARSPGSQREDLPRRDQGAA 
QALRSYGNRTGRGPPGCPGMATQGTGTRPVQAELEHVLESSADDLGLCHRA* 

>ORF22074c (SEQ ID NO:265) 

VRHKPLRYWHYELFVSLLCLAADEDQLIFPVDVALADAQQRIQPDAGEVEDLKRAKPEPGGRDPCAFAGMLG^VLHVEP 
IAGEQDGFGGDGFACSPGGNDGGLSERVDLGVRHQLMLDGIAPDRPQMIRVRVPAPLGQALFQYLASPSQDSPWHVLYG 
CRRTEVLEQRLGRGAVDLRAGFPEIVREDVLLGELLRRHSTPPSASVMTVRLGLSRGCSGNSNLGSWSR* 



>ORF21421 (SEQ ID NO:267) 

AASGQPKPHGHHRCRRRCGHTPQQLTEEYIFAHDLREASAKIYRAATKALLKHFGPTATVQDVDHRAVLGWRRKVLEQGL 
SKRSWNTYSmRTIWGYAIEHELVraSQVNPFRKTTVIPPRRASKTVAAEAILLARNWLNMQDGAERCTGERARITPAW 
FWLCTFEVFYFTGIRLNALLCIRKRDIDWENQLILIRGETEKTHKEFVVPITEGLVPHLSRLLQEADRAGFADDDQLFHV 
NRFSPHYKSKVMSDQVEAMYPJCLTEKVGVRMTPHRFRHTIATO^ 

AVLHARSLAQGALENVRKVDYSGSPQASARPKPCGQPLARVSEAPPPEARTEPAEPREHTPGTGIQGGPTAWEADALPQP 
PDTFEPSVLFTLMAQNLSNRAASASAAPAATSGSGGWGSAARSNLA* 

>ORF22608c (SEQ ID NO-.269) 

RICFPRGWTSLNACPWRVLPWFCRLCPGLRWRRFTHSSERLPAWLRFGRGLRGAAVIHLPDILQRALGQASSMQHGTHVI 
WGLDVAHRGLDIRWEQALREVNVPLGCLHQVGGQGVPETVRGHPHPNLLGQLPVHGFDLVGVHHLALWR* 

>ORF22626 (SEQ ID NO:271) 

HLRTKRAVHSDGSKLIEPCRLGIRGSRCNKRIRRMGICRPKQSRLAIPVLRAGYRTKGSRAFQQIVRPVGKIWNYRERLD 
SSAGMLAEPAQFQGQYHSTLCNR* 
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>ORF23228 (SEQ ID NO-.273) m „ nn%nxnn 

RrSNSRHPAPKAGALPDCAIPRLEFGSATWTRTRDPMINSHLLYRLSYRGTSPFQPWTLPVLLDSRLRGAPPYGCARACQ 

PSDPKSFSSFSTSDKTALPLHAAALSRLPDAHEKAPPKRGPPCPPPKRSGEDDLVAFHLRRDTGTRREFAGQDQLRQRVL 

DPALDGPLQRACAIDRVEADGNQLVQRLLAQFQAQLALGQALAQATELDLGDAGDLLASQRLEHHHFVDPVDEFRTEVRI 

DRVHHCGTLRLAVAGQLLDLRRTEVGGffiJHHGVAEVHRT^ 

GQVAAFLEADVARRRADQAGHRWLHELGHIYPHQRLLGIEEELGQRLAQLGLAHPGRAEEEERAARPVRIGEAGARTAH 
GVGHGDYRLVLADHSPMQLLLHAQQLLALALEHLRHRDTGPLGNHFGDFLVGHLVAQQLVLGLAVLVDHLQAAFQVRDGL 
VLDARHALEVALAPRRLHLLLGLLDLLLDLRRALHLGLLGLPDLLEVGVFALELDDILLQLGQALPGGFWFLLQRLALD 
LQLDQATVETIQFLRLGVDLHADAAGGLVDQVDGLVRQLPIGDVAVRQLGRGDDRAVGDAHPWHFIAFLEATEDGDGVF 
LARFVHQHLLEAALQRGILLDVLAILVEGSSTDAVQLAARQSRLEHVAGVHGTFRLAGADHGVQFVDEQDDPAFLLAQFV 
EDRLQAFLELAAELGTGDQRPHVQGQQALVLEAVRHFAVDDALGQALDDGGIADAGFADQHRVVLGPPLQDLDGPADLVV 
ATDHRVELAFLGALGHVDGVLVQRLARLLDVRVVHRFAATQVGHGILQRLARHALAEQQLAEPGVLVHRGQQYQLAGDEL 
VALLLGQAVSLVEQACEILGQVHVAGRALDLRQRVEFFVEAAAQGGDIEADLHQQGLDRTALLLEQGGKQVHRLDGRMVM 
ANGQGLGVGERQLQLAGQTVYSHGSSFLL* 

>ORF23367 (SEQ ID NO:275) „ Mmmm 
AIAERLSSNPGRFRCCWIRVSEARHFTDARGHVNPLIQKVFLLFPRATKRPFHCMRQRSRAYRTPMKKPRRSGAFPVRPR 

RGQAKTISSPSTFAEILAPGANLPARISCASGFSIQRWMARFSGRAP* 

>ORF25103c (SEQ ID NO:277) 

SAPRGEHHRRRDHRRGQAVAPLHHRSATAGQGHRPDRRGRQPHPHGDRLQAGGTGSSRPSPDPAEDRARGAEEGRRRSHQ 

EAPGQAGGGYRQARARIRRPRGDLEVREGRGAGLGADPAEDRAGQAGDGGGAAQGRPREHGAHPVPDHPGPGTQPADGRP 

ARQDREPVAAQQGDRRGNRRSGFQVDRYPGVEDARGRAREAAAHGAGAASASDRPGRGGSRRVQRRAPFARRPRRSEPAE 

RLVPLPRPDRGGQDRWQGAGRVPLRYRGGAGADRYVRVHGETLGGPPDRRASGLRRLRGRRLPDRGDPPQALLGGAAGR 

GGEGPSGCIQHSPPGARGRTPDRQSRAYGGLPQHRGGDDLQPRFGADPGAGRRPRGATCRSDGRGQCALPSGIHQPDRRS 

GGVRAAGSRADRRHRRDPARSPAQAPGRARAEPGTEPGGAGQADCRRLRPGLWRTPAEAGHPALDREPAGATDPGRQIRA 

GCQYLGEGGRRRDRLRLTSSGADRESPASAGLFHGRPVGARALPHAVEGPFCRSWKKKKNFLDQRVDMPARIRKMARL* 

>ORF23556 (SEQ ID NO:279) 

KSPAEAGLSLSAPEEVRRRRSRRLPPSPRYWHPARICRPGSVAPAGSRSSAGWPASAGVRHRPGRSRRQSACPAPPGSVP 
GSARARPGACAGDRAGSRRCRRSAREPAARTPPLRRSG* 

>ORF26191c (SEQ ID NO:281) 

KEGRPMRIDRLTSKLQLALSDAQSLAVGHDHPAIEPVHLLSALLEQQGGSIKPLLMQVGFDIAALRSGLNKELDALPKIQ 
SPTGDmSQDLARLLNQADRLAQQKGDQFISSELVLLAAMDEOTRLGKLLLGQGVSRKALENAVANLRGGEAVNDPNVE 
ESRQALDKYTVDMTKRAEEGKLDPVIGRDDEIRRTIQVLQRRTKNNPVLIGEPGVGKTAIVEGLAQRIINGEVPDGLKDK 
RLLALDMGALIAGMFRGEFEERLKAVLNELGKQEGRVILFIDELHTMVGAGKAEGAMDAGNMLKPALARGELHCVGATT 
LDEYRQYIEKDAALERRFQKVLVDEPSEEDTIAILRGLKERYEVHHGVSITDGAIIAAAKLSHRYITDRQLPDKAIDLID 
EAASRIRMEIDSKPEELDRLDRRLIQLKIEREALKKEDDEATRKRLAKLEEDIVKLEREYADLEEIWKSEKAEVQGSAQI 
QQKIEQMQEMEAARRKGDLESMARIQYQTIPDLERSLQMVDQHGKTENQLLRNKVTDEEIAEVVSKWTGIPVSKMLEGE 
REKLLRMEQELHRRVIGQDEAWAVSNAVRRSRAGLADPNRPSGSFLFLGPTGVGKTELCKALAEFLFDTEEALVRIDMS 
EFMEKHSVARLIGAPPGYVGFEEGGYLTEAIRRKPYSVVLLDEVEKAHPDVFNILLQVLEDGRLTDSHGRTVDFRNTVVV 
MTSNLGSAQIQELAGDREAQRAAVMDAVNAHFRPEFINRIDEWVFEPLAREQIAGIAEIQLGRLRKRLAERELSLELSQ 
EALDKLIAVGFDPVYGARPLKRAIQRWIENPLAQLILAGKFAPGASISAKVEGDEIVFA* 
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>ORF23751 (SEQ ID NO:283) 

TGSKPTAISLSSASWLSSRLSSRSARRLRRRPSWISAMPAICSRASGSNTTTSSIRLMNSGRKCALTASITAARCASRSP 
ASSWICAEPRLEVITTTVLRKSTVRP* 

>ORF24222 (SEQ ID NO-.285) 

PGGAPIRRATECFSMNSDISIRTSASSVSKRNSASALHNSVLPTPVGPRKRNEPL6RFGSARPARERRTALDTATTASSW 
PITRRCSSCSMRSSFSRSPSSIFDTGIPmETTSAISSSmLRSNWFSVLPCWSTICRLRSRSGMVWYMlAMLSRSP 
LRRAASISCLACSIFCWICAEPCTSAFSDFQISSRSAYSRSSLTISSSSLARRFLVASSSSFFSASRSIFSWIRRRSRRS 
S SSGLESI SMRMRLAASSIRSMALSGSCRSVM* 

>ORF24368 (SEQ ID NO:287) 

TRTYLSAPAPPRYRRGTRPAPCTTRSCPPRSGRGRGTSRSAGSDRRGRRANGARRWTRRLPPRPGRSLADAAPAPCAAAS 
RARPRASSTPGYRSTWKPLRRFPRRSPCCAATGSRSCRAGRPSAGCVPGPGWSGTGCAPCSRGRPCAAPPPSPAWPARSS 
AGSAPSPAPRPSRTSRSPRGRRIRARA* 

>ORF24888c (SEQ ID NO:289) 

RRKTTRPPGSAWPSWRRISSSSSANTPTSRRSGSPRRPRCRARRRSSRRSSRPSRRWRRRGARATSRAWRASSTRPSRTW 
NAACRWSTSTARPRTSCCATR* 

>ORF25398c (SEQ ID NO:291) 

RRSSTMASRWGSSCSSTNCTPWSAPARRKVPWTPATCSSRLWmSCTASVLLPSTSIASTSRIMPRWSAASRRCWWT 
NRARKTPSPSSVASRNAMKCTTG* 

>ORF25892c (SEQ ID NO:293) 

PPGPAEGRPVHLQRAGIAGRDGREHQARQAAARPGRVAQGAGECRGQPAWRRSGERPERRGVAPGAGQVHRRHDQARRGR 
QARPGDRSRRRDPPDHPGPAAADQEQPGADRRTRRRQDRHRRGPGPAHHQRRSAGRPQGQAPAGPGHGGADRRCQVPRRV 
RGTPEGGPQRTGQAGRPGHPVHRRTAHHGRRRQGGRCHGRRQHAQAGSGARRAALRRCYYPRRVSPVHREGCRAGAPLPE 
GAGGRTERGRHHRHPPWPQGTL* 

>ORF25110 (SEQ ID NO:295) 

RSLRPRRMAM7SSSLGSSTSTFWKRRSSAASFSMYWRYSSRVVAPTQCSSPRARAGLSMLPASMAPSALPAPTMVCSSSM 
NRMTRPSCLPSSLRTAFRRSSNSPRNLAPAISAPMSRASRRLSLRPSGTSPLMMRWARPSTMAVLPTPGSPISTGLFLVR 
RCRTWMVRRISSSRPITGSSLPSSARLVMSTVYLSSAWRDSSTFGSFTASPPRRLATAFSSALRDTPWPSSSLPSLVFSS 
IAASNTSSLEMNWSPFCWARRSAWLSRRARSWDRFTSPVGLWIFGSASSSLLRPLRRAAISKPTCISRGLIEPPCCSSRA 
ESRCTGSMAGWSWPTARDWASESASCSLLVKRSIRMGRPSFYRAGRNDGCP* 

>ORF25510 (SEQ ID NO:297) 

CAGPGPRRWRSCRRRVRRSAPGCSWSAAAGPGWSGGSRRRDRSPGRACLPRRAWSCRRCTCPAPGATPRRSGRSPLRRHA 
GWPRHSPAPCATRPGRAAACRAWCSRPSRPAIPARWR* 

>ORF26762c (SEQ ID NO:299| 

PPTACRRCSATARAPGWPRPMPAGAGWRRACWRRRWTAWACPATNCWSGWGRRSARRPSRSAARSAHHSSLRTPRRARLS 
YLAPIRAASWPTSTDSRGSAWAPMASPPCMAAASAPSAIPRASIPTAARRVPAVLPAWSGSRTRPAQVIRRQLTDVTVRS 
LEPRKIALIY* 

>ORF26257 (SEQ ID N0:301) 

IRAIFRGSSDRTVTSVSCRRITCAGLVLEPDQAGKTAGTRRAAVGIEARGIAEGAEAAAMHGGDAMGAQADPRESVDVGH 
EAARIGARYESRARLGVRSDECIADLAADLEGLRADRRPQPDQQFVAGHAQAVHRRLQHARRQPAPAGMGRGHPGARAVA 
EQRRQAVGGHDRTGDARHRAPAGVGPEHRFGSASTTSLRAPIPTSSAGIPDARSGVGGFIPRRADRRRRGRQGSDCRRVP 
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>ORF26844c (SEQ ID NO.-303) 

RGGGRPEPVLRADASWSAMPGVACTIMTADCLPALFCDRS6TRVAAAHA6WRGLAAGVLEATVDSL6VPGDELLVWLGPA 
IGPQAFEVGGEVRDAFVAAHAEARSAFVPSANPGRFMADIYRLARIRLGAHGVTAVHGGGFCTFSDTARFYSYRRSSRTG 
RFASLVWLQD* 

>ORF26486 (SEQ ID NO:305) 

MSAMKRPGLALGTKAERASACAATNASRTSPPTSKACGPIAGPSQTSSSSPGTPRLSTVASSTPAASPRQPAWAAATRVP 
ERSQNNAGRQSAVMIVQATPGIALQLASALSTGSGRPPPRHSVHLFQPARPAFQTLGQASAVLFHGARIWDVGAKVQTV 
EGCLADPATARGHAGPHTGRRRPVGGQPGVQPTNASRSWRNRLSSQRNSSGSGDSHFMRTPVAG* 

>ORF26857c (SEQ ID NO:307) 

VHGVTWWRPTRTGAQGRRQLEROAGRRLYDHDRRLPAGVVLRPLGHPGGRGPCRLARAGGGRAGGDGGQPGRAIiRRTAGL 
AGAGDRPAGLRGRRRGPRCIRRCARRGALGFRT* 

>ORF27314c (SEQ ID NO:309) 

SGNRRCRKNSSGCSACCARIARRSSVERLADPRLAGAGPCAGLRDHAQWRGQPGTLRQSEPWRPRLRRSARRGIKPPTPD 
RASGMPAELVGIGARSDWEADPNRCSGPTPAGARCRASPVRS* 

>ORF27730c (SEQ ID NO:311) 

QARRPGGPSGCRPSGRHPAECLALPCPGHRQCAARRDRPPPGQGHDRPDGSGQDAGGPHQAGGATAGTVGQPHLRGDRDR 
RDHLRRHHRCADRTAWRAAAEDGGGRRRQGGGQPLPRAGTLPCAHPYPGQAGDRAYPPDPRAHEPYWLSPGRRSGLRWAL 
QDSPGGQPDPGPDSSRIPPAGAARALPRTGSPGHRRAHEVGIAAAGRIPLAAQPVAPGSRGVRRLNAWLTPDWPAPARVR 
ACVTTRSGGVSQAPFDSLNLGAHVYDDPRAVE* 

>ORF26983 (SEQ ID NO:313) 

PRHCAWSRRPAHGPAPASRGSARRSTDERLAILAQQAEQPEEFFRQRRFPLHAHAGGRVIQFEEARVQRLPGEFAKSLDQ 

GLAGHRGNPEAPTVDRIADQGIANMAHVHADLVGTPGLQLDPGMGVRTEAFQHAVMACRHLAGVDHRHLLPLHAMPS 

IDGAAGGDHADHDRLVDAADRPCLQLRHQLGVGLQRLGHYHQAGRVLVQAVDDPGARHIGDVRDMVEQGIQQGAVLHAGS 

RMDHQAGGLVNHQDVLVLVDDFQLDVLCEPLALGFLLGLQDQLRAAVDDVARAQHGAVDGQATVLDPAGQTGAGVFGKKL 

GGDLVETLATQLERHLGRALNHIGHE* 

>ORF28068c (SEQ ID NO:315) 

PQRVADSRSRAEHRLLLHSDHIQRAAEVPFELGGQRLDQIAAQLFPEHSRSRLAGWIRDGRLTVDGAVLRPRDIVHSGAQ 
LVLEAEQEAQGEWLAQDIELEIVYEDEHILVIDKPAGLVVHPAAGHQDGTLLNALLYHVPDIANVPRAGIVHRLDKDTTG 
LMWMTLEmKLVAQLQARSVSRIYEAIVIGVITSGGTIDAPIGRHGVQRQmVVDAGmVSmVLERFRAHTHT 
RVKLETGRTHQIRVHMSHIGYPLVGDPVYGGRFRIPPVASQTLVQTLREFPRQALHARFLELDHPATGVRMKWESPLPEE 
FLWLLSLLRQDREAFVG* 

>ORF27522 (SEQ ID NO:317) 

PTVPAVAPPAWCGPPASWPLPSGRSCPCPGGGRSRRAAHWRCPGHGRARHSAGCRPDGRQPDGPPGRRACQSPGCARPRR 
RFPARCPVRATRPGLPARPPGPVARRCGRCRAGAARRRRRSGDRP* 
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>ORF28033c (SEQ ID NO:319) 

ASSPTHVRYDSTRGR6AVRAGWPASRPDRRPAFSRTLPLPSGRLDQGRSPDRRRRRAAPARHRPQRRATGP6GRAGSPGR 
VARTGHRAGNRLRGRAHPGD* 

>ORF29701c (SEQ ID NO:321) 

SSSSLEISRTSTRPMVRRYRWYRRRMCPCSSLSRSRSARTVALVLAQVRLAAIPALFVGEGVGLRHVDAAMGAADHRRC 
ARLVLRTLLLARSGTGKATPEPERDGDQGDPEQEAEKAHGDLGGWRKLQFSQAAGSIPDGKVQAVRRLALGEAWRRAKRR 
EACASLRCFVRSVEETEGHVAPPGATGVLWIALRLLWGAVILVFRLQPGGDLPLGILVLLDHVLGGLGPlMRRIiAAF 
DQAQGGLGQPGAGVGLAFAGDELAILEAGVIRIVQLEGFQAGAGQWETQATVGFDHDRQAIADGRGFLEVLHHVATAVG 
GGDIGLALQVVVADVHFVGRQQVAQVHHARLGVRGVAAVGEAAGELGELVEGVAGGARVALGHVQRQEARQQAAVLVEGG 
QAFEWGVVDVGVLRMQADEAFGGGAGGFGLHVLWGVDQLELGLLGVAAEGIARFEGFQLGDGAVVALVVEVVLRLLVQ 
LALAQVLVDSLLVRGAGCGEGEDGDQQQVFHLHGGLRPWDGRLGLNRLL* 

>ORF28118 (SEQ ID NO-.323) 

QTVEMPTWGTQAMQVKHLLLIAILALTMCSSNKETVDENLSESQLYQQAQDDLNNKSYNSAVTKLKALESRYPFGR 
YAEQAQLELIYANYKNMEPEAARAAAERFIRLHPQHPNVDYAYYLKGLSSFDQDRGLLARFLPLDMTKRDPGAARDSFNE 
FAQLTSRFPNSRYAPDAKARMVYLRNLLAAYEVHVGHnLKRQAYVAAANRGRYVVENFQETPAVGDGLAIMVEAYRRLG 
LDDLASTSLETLKLNYPDNASLKDGEFVARESEADTRSWLAKATLGLIEGGEPPPHMETQAAKDVIKQYEDAEREIPAEL 
KPENQDHSADDEKPESDDDEDSGRSWWSYMTFGLFD* 

>ORF28129 (SEQ ID NO:325) 

GQADRPRDASRHASETPAADRHPRPHRSLLLEQGDCRREPEREPAVPAGAGRPQQQELQQRRHQAESPRIALSLRPLRRA 
GPARADLRQLQEHGARSRPRRRRTLHPPASAAPQRRLRLLPQRPVLLRPGPRPAGALPAAGHDQARPGRRPRLLQRVRPA 
HQPLPQQPLRPGRQGAHGVPAQPAGGLRSARRPLLPEAPGLCRRRQPRSLRGGELPGNPGRRRWPGDHGRSLPSPGSRRP 
GQHQPGNPQAELSG* 

>ORF29709c (SEQ ID NO:327) 

GPDLPVRWRSAGPVPGRWSGGTGGTDGGCVAPAPRSAVAVQRARSPWSWRRCAWLQYQRCSWARALACGTWTPQWAQRTI 
GDALGSSCGRCCWRGVGRVKRRQSQNAMAIRAIQNRRRKRPMVISEAGESCSLAKPPARSQTGRSRLCGVWRWERHGGGQ 
KEGRPAPPFGVSCDQSKRPKVM* 

>ORF29189 (SEQ ID NO:329) 

SHETPKGGAGLPSFCPPPCLSQRQTPHSLDLPVWDRAGGLAKLQLSPASEITMGLFRLLFWIALIAIAFWLWRRFTRPTP 
RQQQRPQDEPSASPMVRCAHCGVHVPQANALAHEQRWYCSQAHLRQDQGDRAR* 

>ORF29382 (SEQ ID NO:331) 

SPSRSGSGVALPVPLRASSNVRRTSRAHRRWSAAPIAASTCRRPTPSPTNNAGIAARRTCARTRATVRAERLRLSEEQGQ 
RILRLYHLYRLTIGLVLVLLISSELEDQVLKLVHPELFHVGSWCYLVFNILVALFLPPSRQLLPIFILALTDVLMLCGLF 
YAGGGVPSGIGSLLWAVAIANILLRGRIGLVIAAAASLGLLYLTFFLSLSSPDATNHYVQAGGLGTLCFAAALVIQALV 
RRQEQTETLAEERAEWANLEELNALILQRMRTGILWDSRQAILLANQAALGLLRQDDVQGASLGRHSPMLMHCMKQWR 
LNPSLRPPTLKWPDGPTVQPSFISLNREDDQHVLIFLEDISQIAQQAQQMKLAGLGRLTAGIAHEIRNPLGAISHAAQL 
LQESEELDAPDRRLTQIIQDQSKRMNLVIENVLQLSRRRQAEPQQLDLKEWLQRFVDEYPGRLRNDSQLHLQLGAGDIQT 
RMDPHQLNQVLSNLVQNGLRYSAQAHGRGQVWLSLARDPESDLPVLEVIDDGPGVPADKLNNLFEPFFTTESKGTGLGLY 
LSRELCESNQARIDYRNREEGGGCFRITFAHPRKLS* 

>ORF30590c (SEQ ID NO.-333) 

LLQQLGGVADRAQRVADLMGDAGGQAMTGQLHLLRLLGDLRNVFEEDQHVLVVFAVEADKAGLHRRAIRHHLERRRTEA 
GIQAPLLHAVHQHRAVAAEAGALHVVLPEQAEGGLVGEEDGLTAIDHEDAGAHALQDQCVEFLQVGDRLGAFFGQRFGLL 
LAPHQSLDHQRGGEAQGAEAAGLDVWGGVRTAQAEEEGQVEQAEAGRRRDDQADAPAQQDVGNGHRHHQQAADAAGYAA 
TCVEQAAKHQHVGEREDEDRQQLPRRRQEQRDQDVEDQVAPTADMEQFRVDELEDLIFQFAGDQQDQYQADGQAVQWQT 
EDALPLLLAQP* 
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>ORF29729 (SEQ ID NO:335) 

TVPCRQLVLP6LQHP6RAVPAAVAAIAADLHPRAHRRADALRPVLRRWRRTQRHRQPA666G6HCQHPAARAHRP6HRG6 
6QPRPALPDLLPQPEQS6RHQPLRP6RRPRHPVLRRRA6DP6SGAAPGADRNA6RRTRRD6RQP66TQRI0PAAHAHRHP 
RGR* 

>ORF30221 (SEQ ID NO:337) 

PSGHPPRQPGRPRPAQAGRRAGRQPRPPQPDADALHEAHAPESQPPSADAQGGAGWPDGATQLYQPQPRRRPARADLPRR 
HFADRPAGAADEAGRSWPPDRRHRP* 

>ORF30736c (SEQ ID NO:339) 

SHSFRSSCCGSAWRRRESCRTFSMTRFIRFDNSWMICVRRRSGASSSSDSCSSWAAWLIAPSGLRISWAMPAVRRPRPAS 
FICCACWAICEMSSRKISTCWSSSRLRLIKLGCTVGPSGTTLSVGGRRLGFRRHCFMQCISIGLWRPRLAPCTSSCLSRP 
RAAWLARRMA* 

>ORF30539 (SEQ ID NO:341) 

DPQPAGRDQPRRPTAAGVRGTGCPGPTPDADHPGPVEADEPGHRERPAALPSPPGRTAAARPEGVASAVRRRIPRQAAQR 
QPTAPAARCRRHPDPHGPTPVEPGAEQPGAERSSLQRPGARARPGLAEPRARPGERPAGAGSHRRRSRRTGGQTEQPVRT 
LLYYRKQRHRPGPLSLPRTLREQPGTDRLPQSRGRRRLLPHHLRPPAQTQLTEAARMSRQKALIVDDEPDIRELLEITLG 
RMKLDTRSARNVKEAASCWPASRSTCASPTCACRTAAASIWSSTSSSAIHRPRWP* 

>ORF31247c (SEQ ID NO:343) 

FPAVRGYPVHRRRSGLFVGSCVRLPSAEFARVGEGDAEAAAAFLAIAVVDPCLVALAEFAGEIEAQAGAFAFCSKEGFEQ 
WQmRYAGTVVDDFQHRQVALRVMEAQPDI^PVRLGAVAKmHQVAQKlVQLVV^GLDVAGTELQVQLAVVAQ 
PAGVFVDEPLKPLLQVELLRFGLAATGELQDVLDDQVHPLRLVLDDLRQASVRGIQFL* 

>ORF30963c (SEQ ID NO:345) 

LPAPAGRSPGRARGSARPGRARAPGRCSEDRSAPGCSAPGSTGVGPCGSGCRRHRAAGAVGCRCAACRGIRRRTAEATPS 
GRAAAVRPGGDGRAAGRSR* 

>ORF31539c (SEQ ID NO-.347) 

GGCHQLPQATEVDRFGEEVEGTGLERLDRGVQAAVRGDHGHRGLWMALLDVLDQIEAAAVRQAHVGEAQVERLAGQQLAA 
SLTLRALRVSSFMRPRVISSSSRISGSSSTIRAFCRLMRAASVS* 

>ORF31222 (SEQ ID NO:349) 

TGYPRTAGNHSRPHEAGHPQRPQRQGSRELLAREPFDLCLTDMRLPDGSGLDLVQYIQQRHPQTPVAMITAYGSLDTAIQ 
ALKAGAFDFLTKPVDLGRLRELVATALRLRNPEAEEAPVDNRLLGESPPMRALRNQIGKLARSQAPVYISGESGSGKELV 
ARLIHEQGPRIERPFVPVNCGAIPSELMESEFFGHKKGSFTGAIEDKQGLFQAASGGTLFLDEVADLPHAHQVKLLRAIQ 
EKAVRAVGGQQEVAVARAHPLRHPQGPRRRSRRRALPPGPLLPPQRHRAARTPLRERREDIPLLAERILKRLAGDTGLPA 
ARLTGDAQEKLKNYRFPGNVRELENMLERAYTLCEDDQIQPHDLRLADAPGASQEGAASLSEIDNLEDYLEDIERKLIMQ 
ALEETRWNRTAAAQRLGLTFRSMRYRLKKLGID* 

>ORF31266 (SEQ ID NO:351) 

SWTPAAPATSRKPRVAGPRAVRPVPHRHAPAGRQRPRSGPVHPAAPSTDPGGHDHRVRQPGHRDPGAQGRCLRLPHQTGR 
PRSLAGAGGNRPTLAQPGSRGSAGGQPPARRVAADARPAQPDRQAGAQPGAGLHQWRVRQRQGTGGAPDPRAGATYRAAV 
RAGELRRDSLRADGKRVLRPQERQLHWRYRRQAGPVPGRQRWHPVPRRSRRPADGHAGQTAPGDPGKGRARGRRPAGGRR 
RTCASSAPPTRTSPPKSAPGASARTSTTASTSSSCAYTAARTPRGHPAARRTHPQAPGRRHRPAGRQADRRRTGEAEELP 
LPGQRPRAGKHAGARLYPVRRRPDPASRPAPGRCAGCQPGRRREPERNRQPRGLPGRHRAQADHAGTRGOPLEPHRRGPA 
PGPDVPLDALPPEKAGHRLKVKRPVRRQAFWFSLLRGDQPGRRGPGR* 
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>ORF31661c (SEQ ID NO:353) 

TGAWLRASLPIWLRRARIGGDSPSRRLSTGASSASGLRKRRAVATSSRKRPRSTGLVRKSKAPALSAWIAVSRLPYAVIM 
AT6VCGWRCWMYWTRSRPLPSGRRMSVRHRSN6SRASNSRLP* 

>ORF32061c (SEQ ID NO:355) 

RSWRKRPAPTSAARSLNVAQRMRTCDGDLLLAADRAHGLFLDRPEQFDLHGHRQVGDFVEEQGATAGGLEQALLVFDSAS 
mFLVAEEIAFHQLGGNmVHRHERPLDTWPLLVDQAPJlQFIJ^GIATDVDRRLAARQLADLVAQGMRRRLAEQAV 
VHRRFL6FRVAQA* 

>ORF32072c (SEQ ID NO:357) 

GGSRGPGGSARRRLRRRGPCGWRGCARATATSCWPPTARTAFSWIARSSLTCMAIGRSATSSRNRVPPLAAWNRPCLSS 
IAPVKLPFLWPKNSLSISSEGIAPQFTGTNGRSIRGPCSWIRRATSSLPLPDSPLM* 

>ORF31784 (SEQ ID NO:359) 

WKASSSATRKAASLALSKTSRACSRPPAVAPCSSTKSPTCRWPCRSNCSGRSRKRPCARSAASRRSPSHVRILCATHKDL 
AAEVGAGRFRQDLYYRLNVIELRVHRCANAARTSRCSPNASSSAWPATPACRPPG* 

>ORF32568c (SEQ ID NO:361) 

GAKTKRPVFGQAFSLSVDAQLFQAVAHRAERQAQALGRGGAVPAGLLECLHDQLALDVFQWLEWDFAQARGAFLAGTR 
RIGQAQWRLDLWFAQGIGALQHVFQLADVAREAWLQLLLCVAGQPGGRQAGVAGQALEDAFGEQRDVLAAFAQRCTR 
SSMTLRR* 

>ORF33157c (SEQ ID NO:363) 

TDGGARLVARRRSSGRRGDLAWRDPWRQGAAGGRRLERRWEAAWPGTARGTGERSDDPLQVRGGFPAAHGAGQGALRDS 
AARRPHPDRQHLGTFGLRQDADRRGAGKPQGVCGRTVAGTGGHAAGGPLGRVAPGLSRRHPLYRSGAWLRRALAEYRALP 
QRAGPGTGWPSAGGSHERAGTDHRPGPLRPGWSPLRSENQKACLRTGLFTFSRCPAFSGGSASSGTSGPGAGPRRCGSS 
GSPRVPA* 

>ORF32530 (SEQ ID NO:365) 

KGLSEDRPFGFRSSEATSRGVGGRVDDRFPPAHEIRQQTARRRCQDQPVAWPGIQPEPVEARHRTDIGDAFGRARAQPC 
PVGHRLHVRQFRQQFCRRRPEAFQRLVGRRLVEARMFQGAADQDVAVAPRNRVAPLGQHHARQEIRRALVEDHLTFHRYH 
GQFQAKRLQQLAAPGACRQQHLVATDLATRGRHADHSIAVAQPAAHLRLFMQLEIGELLQGCPQ* 

>ORF33705c (SEQ ID NO:367) 

VIFLCSWQIGRSPWSRDVVWGAGVIGLLTARELALAGLRVTLVERGESGREASWAGGGIVSPLYPWRYSPAVTALAHW 
SQDFYPALGQRLLDETGLDPEVHTVGLYWLDLDDQTEALQWARNHTRPLKEVPIEEAYAAVPGLGAGFQRAVYMSGVANV 
RNPRLARSLRASLQQFANLELHEQTEVRGWLRDGDRWGVATSRGEIRGDKVLLAAGAWSGELLKPLGLELPWPVKGQM 
ILYKCAADFLPRMVLAKGRYAIPRRDGHILIGSTLEHSGFDKTPTDEALESLRASAAELLPELADMQPVAHHAGLRPGSP 
EGIPYIGPVPGFDGLWLNTGHYRNGLVLAPASCRLLADLMSGREPIIDPAPYAPAGRL* 

>ORF32832 (SEQ ID NO:369) 

GFPAPRRSASCRSPNVPRCCRSGCGRRAAESRSAPWPAPCAAGNPPRTCRGSSDLSPVPRAVPGQAASTTRRSRRLPPAA 
PCRHGSRHARSPRRPLDRRRATSRAPPSVHATRDWRIVAGMPAMSAPGEDCAHWPRPTCRPPAGSLRPARAPPRRPPRSA 
LPSTAGCGCVPTAVPRSGRPGPASTGQRYGPRDRARSRRANAAPGPGRSPATSAPGRSPPGCSATDRAARRSRLPPRMPH 
AHSRPAPPGSPAVRRAPAPGRSTGR* 
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>ORF33547c (SEQ ID NO:371) 

GILGGRRDRLAALSVALQPGGDRPGALVAGLLPGPGAAFARRDRARSRGPYRWPVLAGPGRPDRGTAVGTQPHPAVEGSA 
DRGGLRGGARAGRRLPAGGLHVGRGQCAQSSPGALIAGIPATIRQSRVA* 

>ORF33205 (SEQ ID NO.-373) 

ARQARIAHIGHARHVDRPLEACAQPGHRRV6LLDRHFLQRPGWACPLQCLGLWQVQPVQANGMDL6IEPGLVEQTLPQ 
GRVEVLRPVRQGGHRRAVAPRIERRDDPASRPGCLTPTLAPLHQGHPQSGERQLPGGQQADDASAYHYYISTHHRAPTDL 
PGTEKYHSKGSDADELPASTNSVESSPGEKPIIPAEVFIP* 

>ORF33512 (SEQ ID NO:375) 

SGETIPPPAQDASRPLSPRSTRVTRSPASASSRAVNRPMTPAPTTTTSLLTTGLLPICQEQRNITQRDQMLTNCLLQRTQ 
SNLVPVKSPSYPQRYSSHEIEWFEPGGTIDSPIDPCDRRDNCAAHPPRQNEAGH* 

>ORF33771 (SEQ ID NO:377) 

KAHHTRRGIHPMSSGLNLVELSIVLSILAIGVTIALPTLPDRMRIIISRDIGDSLTSHVMAARASSIQNGVIIEVCGSG 
DGSTCSEEWHLGWFSRNDRSQQILARHENTSRTDIHWRGFDKRLRYLPNGTSPTGNGRFFECKDDRIEWQLVLNRQGRLR 
VAGKSENKKLSYLCSRR* 

>ORF34385c (SEQ ID NO:379) 

WRAHCPCSRDSRSGEWDDWQTVCETVLSPPGAQVRELFIFALSRHPEAALPIEHQLPLDAIVLTFEETPVTCRAGAIRQV 
AQSLVEAPPMIGATRIFMPGQYLLAPVVTAEPAEHPFLAAGTAnATAHLDDHAVLYAASPRSHHMTSQAVTNITANVP 
LHSVGEGGQRNCHAYRKDR* 

>ORF33988 (SEQ ID NO:381) 

SSRCAVAVTAVPAAMGISAGSAVTTGANRYWPGMKIRVAPIFIGGASTSDCATCLMAPALQVTGVSSNVRTIASSGNWC 
SIGKAASGWRERAKIKSSLTCAPGGERTVSHTVCQSSHSPLRLSLLQGQCALH* 

>ORF34274 (SEQ ID NO:383) 

KALLPVLQAVRELFHIPFASHPTLRSGCLCYRDNALSTRQDYLALFLVEYCMRSICRSAGFSLIELMMVLVLVAIFASIA 
VPSFNALIEmiQTASEELYSLLQYARSEAVNRHANVSIRATQNNDWAKGLEIISGATTVQKHQGFQQVSLSASSATAE 
LTFNATGTLSNQAANIDIKVCFAGDKSTGRLLTVQPSGRVILYPSSKQPDSCN* 

>ORF34726c (SEQ ID NO:385) 

RDLLETLVLLHGGRAADDFQAFCPVIVLRRPDAHIGMTVYSFAASVLKQAVEFLAGSLDSVALDQGVETGYGNAGEYGDQ 
NQHHHQLDQGKAGAATNRAHAVLHKEKGQIILPSGERIVPVAETAGAESGMTGKRYVKQFSHRLEHR* 

>ORF34916 (SEQ ID NO:387) 

GKPHSRETGFSMIEVLVALVLISIGVLGMVAHQGRTIQYTQESVQRNAAAHLASDLMEIMRADPDAVLNLRAQLREDSVY 
YKAKGSDFPAAPARCAPLPADAKERLGCWAQQASKDLPGASALLNSQFYICRSPTPGTCDNTKGSAIEIQVAWRAMDGAC 
FNASDSTLCTYSVRSEL* 

>ORF35464c (SEQ ID NO:389) 

RACLFSQFGADAVGAQGGVRGVETRSIHGSPGNLDFDGRAFGWAGTRGWAATNVELAIQECGGSRQVFRGLLGPAAETF 
LSICWQWRAARWGCGKVAALGLWDRVFA* 

>ORF35289 (SEQ ID NO:391) 

IANSTFVAAQPRVPATTPKARPSKSRLPGEPWMERVSTPLTPPCAPTASAPNCENKHALQQNAERPIDGRTARGTRYKQL 
PDPGDQPDLHRQQTQLSFPARPGRQPGK* 
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>ORF35410 (SEQ ID NO:393) 

LHLVHLQRPLRIVRTSMLFSKMQKGLSMVELLVALAISSFLIL6ISQIYIDNKRNYLFQQ6QAGNQENSRFVLMLLQQQL 
DKTAraRLHDDNMENAFKSATFNGCRAFVAGETIAAATALKAGEYGVCLRyQPAYKGEHDCLGNEITGVPEKPFTNTPPV 
WRLVYLPSAGTLSCSRPDIAQSKSGELVSGLTDFRLEAGVGPADRSERKVSSFVALQDVAGRPIRALRFSILAGSDNTS 
LRTGDDSQARDRWIVLYPESKSAIEAADKGQIYQIARGNQTIRNLMP* 

>ORF35907c (SEQ ID NO:395) 

VDQANDDRGSICEGLFRNSGNFITEAIHLPFVGGLIAQADTVLTRLEGSCRSDSLASHKCTrAIECRGFESILHVVV^ 
TIGCLIQLLLQQHKNEAAIFLVAGLALLEKIVAFWDVDLADPQDQEAAYSECHEQFYHR* 

>ORF35534 (SEQ ID NO:397) 

SWGSARSTSTTNATIFSSKARPATRKIAASFLCCCSNNWIRQPIVAFTTTTWRMLSNPRHSMAVVHLWLARLSLRQLPSR 
RVSTVSACAINPPTKGSMIASVMKLPEFRKSPSQILPLSSFAWSTYRAPVP* 

>ORF35930 (SEQ ID NO:399) 

VAWPISPSRNRENWSWSQTSAWKRGSGQQIVANAKYPASSHYRMSPWLSEHCASQSWQAATIQACAQEMIARHAIAG 
SSFIPRAKAPSRPQTKARFTK* 

>ORF36246(SEQID NO:401) 

PNHQESHAMTLRHTSRQQGSTLLISLVILLMITLLAVSNMREVSLESRITGNLIEQKRLRNAGEAGLREGERRFFNTIKP 

PEVGS^SNmPCILNLSALSVPRDDVHNNPVAAM 

EQNNEAENPEYGNMMRGVGTFYYETNSRALNKAGGETVLQAVHARLYTN* 

>ORF26640c (SEQ ID NO:403) 

GIQTRELALSVLPFRAATGLLCTSSRGTERALRFSMQGRLTLLSAHPLPTSGGLMVLKKRRSPSRSPASPALRRRFCSMR 
LPVIRLS SDTSRMLETARSVI INKITSEINNVDPCCREVWRRVMA* 

>ORF36769 (SEQ ID NO:405) 

CAGSARSTTKPTAAPSTRRAERLFYRPFMHACIPTDWSQRMIHQITRAGKSLLAAGCTLSILFASDSYAATALNVSQQPL 
FLTQGVAPNLLFTLDDSGSMAWAYVPDGISGNSGRAGRSSDYNALYYNPDYAYQVPKKLTLSGDQIIVSDYPVPRFTAAW 
QDGYAQGSTTNLSNNYRPQWGTGWLGCIDSSCNTGRAYYYTYKVSASCPAQPVSSSNSCYTYNALPTSQESNFAIWYSYY 
RNRILATKTAANLAFYSLPENVRLTWGALNTCSIGMSRSCQNNALLQFNKQHKINFFNWLANSPASGGTPLHAALDRAG 
RFLQTNGTAYTTEDGmSCRASYHIMMTDGIWNGRNVTPGNLDNQNQTFPDSTLYRPQPPYADSNASSLADLAFKYWTT 
DLRPSIDNDLKPFHAYKSGDDSKDYWDPRMPATWQHHVNFTVGLGLSYSLTLNSAPTWTGSTFGNYEELHAGSKAWPSV 
DNDAAPGNVYDLWHAAINSRGDFFSAESPDSLVQAFNKILTRISERNTSSSKPAMTSALQDDGTGDKLIRYSYQSSFASD 
KNWAGDLIRYKVESTSTGSTKTQEWSAGALLDNRAPATRNIYIAS^ 

TKGAQRVDFIRGQQNMDGFRQRQAVLGDIVHSSPAWGPAQYLTYLANPIEPSGDYGTFKTEADQRSPRVYVGSNDGMLH 
GFNIKTGVEEFAFIPTAVFEKLNKLTGISYQGGAHQYFVDATPWSDAFFDGAWHTVLIGTLGAGGRGLFALDVTKPDDV 
KLLWEYDSSTDSDLGYTFSKPTVARLHSGQWAWTGNGYGSDNDKAALLLIDLKKGTLIKKLEVQSERGIANGLSTPRLA 
DNNSDGIADYAYAGDLQGNIWRFDLIGNTRNDDPDTNTSINPFKPGDVDPSAFRVSFSGAPLFRARADNNTRQPITAPPT 
LVRHPSRKGYIVIVGTGKYFEDDDAQADTSRAMTLYGIWDRQTKGESANSTPTIDRNALTAQTMTTEANSTFGSVNRNIR 
LISQNPVKWYKDGATGTMSDVASYGWRLNLEWSSKKGEMMIEDMFAAGQVLLLQTLTPNDDPCDSGSTSWTYGLNPYT 
GGRTSFTVFDLKRAGIVDSGSDYNGSWSAFQQDGLGGLAITQNEQRQSEACTGDECIIFNPSDKSNGRQTWRWEEK* 

>ORF37932c (SEQ ID NO:407) 

AGIAVGIRGLWPIEGAIRKGLVLWEIAGGDVPTVPDTVGHHDVIAGPAGICLS7FGGISCAVGLQEASGSVKSRMQRST 
AAGRAVRQPIEEIDFVLLVELEQGIVLAASAVGADATGVQGPPSETHVFRQAVKGQVGSGLCGQDAVAIVGVPYRKVAFL 
TGRKSIVGITGVGAAHRLCRAASAYLISIVISSPGIAAAIDATKPAGSPLRAIVIAQVGGGALGVAILPGCCEAWHWIVG 
NDDLIA* 

Fig. 4-19 



73/133 

>ORF38640c (SEQ ID NO:409) 

LTSLNIAPCERLKAVGSAIAGDVNITGS6SSWQQCAGAPPL6PGRT6GSRLHLVTYKVARPVLVTGKA6LVAVADQLVA 
GSVILQRGSHCWFGGGGVALGNPCQDLIESLNQRVR* 

>ORF39309c (SEQ ID N0:411) 

SCLVIIASIAVAGNYCPLSAVQSGYGRFGEGVTKVRVGTAIVFPKQLDIVRLGYIECEQAATSSTKRSDQNSVPSSIEKG 
IADDRCSVDEILVGTALVADAGKLVKLFEYCCRDESELFHAGFDVETMQHAIVGSNINSRAALVCLCLECAWAAGFDGV 
GQISEVLGRSDHGWRRVHDVP* 

>ORF38768 (SEQ ID NO:413) 

GTSCTRLQPWSDRPFTSLIWPTPSNPAATTAHSRQRQTSAALEFMLDPTMACCMVSTSKPAWKSSLSSLQQYSKSLTSLP 
ASATRAVPTNISSTLHRSSAMPFSMELGTLF* 

>ORF40047c (SEQ ID NO:415) 

KHLAGSEHIFDHHFAFLAAIDLQIQSPAIASHIRVRGTGCSVFVPLHRVLANKPNIPVHATECGVRLCCHGLGCEGVAVD 
GWGTVCAFALGLAIPDTIERHGSAGIGLSVIVLEVFSCTYDDDVALTARMAYQGRRSRDGLTSIVVGASTEKRGAAERYS 
ESRRIYISGLEGIDRGICVWVWAGIADQIEAPDISLQIASIGWSNAIAWISQARRR* 

>ORF40560c (SEQ ID NO:417) 

PASPWGHGWASDMSAHRLLPDRRQDEHPRPRPHSRTYSWHSRQRLGKPELELQELWREDVRGSCSFLLDDPPGLSSVTL 
VAGVEDDALITSASLGLTLFVLGNGQAT* 

>ORF40238 (SEQ ID NO:419) 

VAWPLPRTNSVNPRLALVMSASSSTPATRVTDDKPGGSSRRNEHEPLTSSRHSSCSSSSGLPNLCLECHEYVRECGRGRG 
CSSCRRSGSSRWADISLAQPCPTTGLAGHILGTSGTDSVFLRQTHQRPARNRWLHYQAGPSRSLRIGAATMKSNRGFTL 
IELMIVVVIIAILAGIAYPSYDEYVKRGNRTEGQALLSEAAATQERYFSQl^YITTQADIGKLHMRiniSGTTVKSSTGK 
YSLTVDTVANDGGYRLIANQAFNDLDCGNLTLTANGEKGRTGSKKSVAECWR* 

>ORF40329 (SEQ ID NO:421) 

RTTNLAGRRGEMNMTCLRLLATALAALAIACPTFALSAOT 

LVRQGQTVSFSGKLTSDLPEIESFYIIKQAPLVPFGSEQQQ* 
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>ORF40709c (SEQ ID NO:423) 

SLCSTSLLLLRSE6NERGLLDNVERLDFWQVAGEFA6ERHCLSLTYQE7DRRVLLLDTVGQAICLPIDYYQT66RMILD 
HAHILERIRGTQGKGWASQS* 

>ORF40507 (SEQ ID NO:425) 

SMGRHIACPTVSNNRTRRSYSWYVRDRQCLSPANSPATCQKSSRSTLSSRPLSFPSDRSSNNEVEQRLHSHRVDDRRSNH 
RYSCWyRLPQLRRIREARESHRRTGITQRSSRYSRALFFTEQYLYHYPSRHRQAAYAQHIGHHSEVLHRQIQPYRRYGSQ 
RRRLSPYR* 

>ORF41275c (SEQ ID NO:427) 

VGGVGRAGTGGTGGEHYLLDAQGTAGQGFVIGIFAHLRKPGLCRALPFSCLGALAPAFCNALLASSPAFLAVGGQGQVAT 
IKIVECLVSDKAITSWGYRIDGKAVFACGGLHCGARCVAHMQLADVGLGSDISIVL* 

>ORF42234c (SEQ ID NO:429) 

STSSRPEPSVMPFPSGEGGSKVHSSMRVAHETAMQIKLMPRGFCAGVDRAIEIVNRALDVFGPPIYVPnHEVVHNKFV 
VDNLRQRGAIF7EELDQVPDNVIVIFSAHGVSQAVRKEAEGRGLKVFDATCPLVTKVHMEVVRYSRDGHECVLIGHEGHP 
F^GTMGQYDASNGGAIYLVEDEADVAALEVRKPEALHYVTQTTLSMDDTSKVIDALRMFPQIQGPRKITOICYATQNRQ 
DAVKELADQCDmVVGSPNSSNSNRLRELAERMGTPAYLIDGAEDMQRGWFDGVRRIGITAGASAPEVLVRGVIAQLRE 
WGASEEQELEGREENITFSMPKELRVKAL* 
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>ORF41764c (5EQ ID NO:431) 

RPPRGGRHHGPVRCQQRRCHLPGGGRGRRRRAGGAQARSPALRDPDHPVDGRHLEGHRCPARQVPADPGAAQERHLLCHP 
EPPGCREGTGRPVRHGPGGGQPQQFQLQPPARTRRAHGHAGLPDRRRRGHATRLVRRCASHRNHRRRLRAGSAGARSDRP 
AT* 

>ORF41284 (SEQ ID NO:433) 

LGDHSAHQHFRRGGACGDSDATHTVEPAALHVLGAVDQVGRRAHALGEFAQAVGVGTVGAAifflQDHVALVGQFLHGIlAV 
LGGIADWLARPLDLRELGAQGIDDLRGWHRQGGLGHWQGFGLAHLQRGDVGLVLHQVDGTAVAGIVLAHGAFHLGVA 
FMPDQHAFVAVAAVAHHFHVHLGHQRAGRVENLQAAPLGFLADRLGNAVGAEDDDDVVRHLIEFLDEDGAALAQVVHDEL 
VVHHLVTHVDRRAEDIEGTVDDLDGAIHAGAEAAGIGEFDLHGGLVGDAVIGRMNLATALPAWEGRSDRRFRPAGRR 

>ORF41598 (SEQ ID NO:435) 

PSRCRPSTGWSGSRSAGLRACAPPARRRRPRPPPGRWHRRCWHRTGPWCLPPRGGLHARSARIRGRRGCSAPLPCAPWSP 
AGRSRRKPSGRAPRLPCGPPGKRRGR* 

>ORF42172c (SEQ ID N0.437) 

QGSFVQSPRRPRDRHANQTRQSPRLLRRRGSRHRDRQPCPRCLRPADLRASRGGAQQVRRGQPAPARRHLRRGTRSGAGQ 
RHRHLQRPRRFPGGPQGSRGARPEGFRRDLPAGDQGAHGSGALQPRRPRMRADRA* 

>ORF42233c (SEQ ID NO:152) 

RRPAGLiniRSLRPSQAGRAVARFIRPITASPTRPPCKSNSPIPMSAPAWIAPSRSSTVPSMSSARRSTCVTRWCTTSSS 
WTTCASAAPSSSRNSIRCRTTSSSSSAPTAFPRRSARKPRGAA* 



Fig. 4-22 
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33A9 (SEQ ID N0.102) 

CAAAGCATAA6ACCAA6ATGGCACATT6CCAACAAAATACCCTTCCCCGCTACCGTT6TTTTATCGTTGTTGCCA6CCCT 

GATCTGGCGGAAAAGCCCGCTCCATGAATCGTCATGGAGCCTCCCATGTTTCAACTCCTTTCCTGGATATCCAGGAAGCC 

GTCCCCCACCCCAACAACCAAAGCTGCCCCAGGGGGATTCATCCTTCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTC 

KCGCCGGCAGCTACTGGA(3AACATCTGGCAGCGC(X:CTCGCTATCCAAGCA(K:AATTCGAGGAGATCTACCGGCGG 

CTGGCCAACTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGA 

TCACGGCCTGGAGATCGTGGCCTACGCACTCAAGGTACGGCAGACCTACCTGCTCCCGATCGGCGCAGCGCCGGAGTCAC 

AGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCGTCGTCGAC 

CTGCAGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGT 

GAAGTCCCGCGAATACCAGCTCCACGGCGCTGCCT 

GGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGC 
GAGATCATCGTGAA(3GCAGACCA(MCCTCAGTTGCACAGGAGCTAGGAG(X:AATCCGGATCGAGCTCT(MCTGCACCG 

CGTCTGATGGATGGCTGACCCAGGACGCACTCTGGCTGGTGAGCAAGCCTGCTGCCGATCAACTGAGAGCCTACCTGCTG 
GCCCAGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATCCAGACAAA 
TGCCGAGGAC^GGCCATTTGGACGGCCACGGTAGACMCGGTGCTGGATGGAGAAACAAGTTCACGCTACTCAAGATTG 
CTCCAGCCTTGATCTG(^GAT(X:TGCC(3AGCGCOT 

TCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACGAAGATGAT 

GCTC(^TCAACCTGCGCCGAGCGTTGCGmCCGG(mCGAGACGCAGGCGATTGCGAAACCCTCAACTGATGATCAAG 

AAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACTCGCCG 

GCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGATTGCGCTCC 

TGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCA 

TCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGCCA 

GGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCT 

GGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTCTG 

GTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCT^^ 

CCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGG^ 

CACGATCTCCGGGAAGCCAGC 



Fig. 5 
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33A9 SEQ ID NO:103 

1 MNRHGASHVS TPFLDIQEAV PHPNNQSCPR GIHPSSEQHG TARHASPPAA 
51 TGEHLAARLA IQAAIRGDLP AATGQLCGVG PAxPCFGKSS PCPSRRDDRS 
101 RPGDRGYALK VRQTYLLPIG AAPESQSAQA EAWSAAAAYG ALAHDIGKIV 
151 VDLQVELQDG STWHPWNGPI NQPYRFKYVK SREYQLHGAA SALFIHQLLP 
201 RTALDWLSRF PELWAQLIYL FAGQYEHAGI LGEIIVKADQ ASVAQELGGN 
251 PDRALAAPKQ SLQRQLADGL RFLVKDKFKL NQPSGPSDGW LTQDALWLVS 
301 KPAADQLRAY LLAQGIDGVP SSNAPPFSML QDQAVIQTNA EDKAIWTATV 
351 DNGAGWRNKF TLLKIAPALI WTDAAERPSP YSGSLWEDG TASTEKPETT 
401 CEIPNGPAEQ QQAPETKMML HQPAPSVAKP ANETQAIAKP STDDQEETDD 
451 LYALLGNINS PLEELDTSHD SPAASPTNTR GEENLQQPLG TKEPTDCAPE 
501 AIEDVFMPSR STDLGQGFVG WMKSGIAARR LFINDTKALV HTVDGTAMLV 
551 TPGIFKRYVQ EHPVLEKLAQ AKETTGWKLV QRAFEKQGLH RKTSKNLNIW 
601 TIKVSGPRKT KELKAYLLQD PKLLFPEQPL DNPSLTVITD AE* 



Fig. 6A 
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33A9-ORF1 SEQ ID NO: 189 

ATG6AGCCTCCCATGTTTCAACTCCTTTCCTG6ATATCCA66AAGCCGTCCCCCACCCCAACAACCAAA6CTGCCCCAGS 
GGGATTCATCCTTCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTCGCCGCCGGCAGCTACTGGAGAACATCTGGCAGC 
GCGCCTCGCTATCCAAGCAGCAATTCGAGGAGATCTACCGGCGGCCACTGGCCAACTATGCCGAGCTGGTCCAGCAGCTC 
CCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGATCACGGCCTGGAGATCGTGGCCTACGCACTCAA 
GGTACGGCAGACCTACCTGCTCCCGATCGGCGCAGCGCCGGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCG 
CGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCGTCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGCACCTGG 
CACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAGTCCCGCGAATACCAGCTCCACGGCGCTGC 
CTCAGCACTTCTCATCCACCAACTGCTACCGCGCACTGCACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAAT 
TGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGCGAGATCATCGT(^G(K:AGACCAGGCCTCAGTT 
GCACAGGAGCTAGGAGGCAATCCGGATCGAGCTCTGGCTGCACCGAAGCAGTCGCTGCAGCGGCAGTTGGCAGACGGCCT 
TCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCAACCTAGCGGCCCGTCTGATGGATGGCTGACCCAGGACGCACTCT 
GGCTGGTGAGCAAGCCTGCTGCCGATCAACTGAGAGCCTACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCCTCTAAC 
GCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGT 
AGACAACGGTGCTGGATGGAGAAACAAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGC 
GCCCCTCACCCTACAGCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATT 
CCCAACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACGAAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACC 
GGCAAACGAGACGCAGGCGATTGCGAAACCCTCAACTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTA 
ATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAG 
AACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGATTGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAG 
AAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCA 
AGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCG 
GTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCA 
TCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACC 
TGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGA 
GGTGTGGAATGA 



Fig. 6B 
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33A9--ORF2 SEQ ID N0:190 

AT6TTTCAACTCCTTTCCT66ATATCCAGGAA6CCGTCCCCCACCCCAACAACCAAA6CTGCCCCAG6GG6ATTCATCCT 
TCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTCGCCGCCGGCAGCTACTGGAGAACATCTGGCAGCGCGCCTCGCTAT 
CCAAGCAGCAATTCGAGGAGATCTACCGGCGGCCACTGGCCAACTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAA 
AATCATCACCATGCCCATCCAGGCGGGATGATCGATCACGGCCTGGAGATCGTGGCCTACGCACTCAAGGTACGGCAGAC 
CTACCTGCTCCCGATCGGCGCAGCGCCGGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCG 
CCCTGGCTCATGACATAGGCAAGATCGTCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAAC 
GGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAGTCCCGCGAATACCAGCTCCACGGCGCTGCCTCAGCACTTCT 
CATCCACCAACTGCTACCGCGCACTGCACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGT 
TCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGCGAGATCATCGTGAAGGCAGACCAGGCCTCAGTTGCACAGGAGCTA 
GGAGGCAATCCGGATCGAGCTCTGGCTGCACCGAAGCAGTCGCTGCAGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGT 
GAAGGACAAGTTCAAGTTGAATCAACCTAGCGGCCCGTCTGATGGATGGCTGACCCAGGACGCACTCTGGCTGGTGAGCA 
AGCCTGCTGCCGATCAACTGAGAGCCTACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTC 
AGCATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGC 
TGGATGGAGAAACAAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCT 
ACAGCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCG 
GCTGAACAGCAGCAAGCACCAGAAACGAAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGAC 
GCAGGCGATTGCGAAACCCTCAACTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGC 
CACTAGAAGAGCTAGACACTAGCCACGACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAG 
CCACTAGGGACCAAGGAGCCAACAGATTGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCT 
GGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGC 
ATACCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAA 
CTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAG 
TAAAAACCTGAACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATC 
CCAAATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 



Fig. 6C 



80/133 



ORF3— 33A9 SEQ ID NO:191 

ATGGAACT6CTCGGCAC6CCTCGCC6CC66CAGCTACTGGA6AACATCTG6CAGCGC6CCTCGCTATCCAA6CA6CAATT 
CGAGGAGATCTACCGGCGGCCACTGGCCAACTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAAAATCATCACCATG 
CCCATCCAGGCGGGATGATCGATCACGGCCTGGAGATCGTGGCCTACGCACTCAAGGTACGGCAGACCTACCTGCTCCCG 
ATCGGCGCAGCGCCGGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCGCCCTGGCTCATGA 
CATAGGCAAGATCGTCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACC 
AGCCATACCGCTTCAAGTACGTGAAGTCCCGCGAATACCAGCTCCACGGCGCTGCCTCAGCACTTCTCATCCACCAACTG 
CTACCGCGCACTGCACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTA 
CGAGCACGCCGGGATCCTCGGCGAGATCATCGTGAAGGCAGACCAGGCCTCAGTTGCACAGGAGCTAGGAGGCAATCCGG 
ATCGAGCTCTGGCTGCACCGAAGCAGTCGCTGCAGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGTGAAGGACAAGTTC 
AAGTTGAATCAACCTAGCGGCCCGTCTGATGGATGGCTGACCCAGGACGCACTCTGGCTGGTGAGCAAGCCTGCTGCCGA 
TCAACTGAGAGCCTACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTCAGCATGCTCCAGG 
ACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGCTGGATGGAGAAAC 
AAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCTACAGCGGATCACT 
GGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCTGAACAGCAGC 
AAGCACCAGAAACGAAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAGGCGATTGCG 
AAACCCTCAACTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCT 
AGACACTAGCCACGACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCA 
AGGAGCCAACAGATTGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTC 
GTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGG 
GACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCA 
AGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAAC 
ATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTT 
CCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 



Fig. 6D 
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ORF4--33A9 SEQID NO:192 

ATGATCGATCACG6CCTGGAGATCGT66CCTACGCACTCAA6GTACG6CAGACCTACCT6CTCCC6ATCGGCGCA6CGCC 
GGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCG 
TCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTC 
AAGTACGTGAAGTCCCGCGAATACCAGCTCCACGGCGCTGCCTCAGCACTTCTCATCCACCAACTGCTACCGCGCACTGC 
ACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGA 
TCCTCGGCGAGATCATCGTGAAGGCAGACCAGGCCTCAGTTGCACAGGAGCTAGGAGGCAATCCGGATCGAGCTCTGGCT 
GCACCGAAGCAGTCGCTGCAGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCAACC 
TAGCGGCCCGTCTGATGGATGGCTGACCCAGGACGCACTCTGGCTGGTGAGCAAGCCTGCTGCCGATCAACTGAGAGCCT 
ACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATC 
CAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGCTGGATGGAGAAACAAGTTCACGCTACT 
CAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCTACAGCGGATCACTGGTCGTTGAAGATG 
GAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACG 
AAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAGGCGATTGCGAAACCCTCAACTGA 
TGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACG 
ACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGAT 
TGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAA 
ATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGG 
TCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGC 
TGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAA 
GGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTC 
TGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6E 

ORF5--33A9 SEQ ID NO-.193 

ATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGCTGG 
ATGGAGAAACAAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCTACA 
GCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCT 
GAACAGCAGCAAGCACCAGAAACGAAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCA 
GGCGATTGCGAAACCCTCAACTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCAC 
TAGAAGAGCTAGACACTAGCCACGACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCA 
CTAGGGACCAAGGAGCCAACAGATTGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGG 
ACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATA 
CCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTG 
GCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAA 
AAACCTGAACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCA 
AATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 



Fig. 6F 
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ORF6-33A9 SEQ ID NO:194 

ATGATGCTCCATCAACCT6CGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAG6CGATTGCGAAACCCTCAACTGATGA 
TCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACT 
CGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGATTGC 
GCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATC 
TGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCA 
CGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGG 
AAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGT 
TTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGG 
ACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6G 

ORF7--33A9 SEQ ID NO:195 

ATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAGGCGATTGCGAAACCCTCAACTGATGATCA 
AGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACTCGC 
CGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGATTGCGCT 
CCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGG 
CATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGC 
CAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAG 
CTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTC 
TGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGGACA 
ACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6H 

ORF8--33A9 SEQ ID NO:196 

ATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCAT 
CAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCC 
AAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAA 
CAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCT 
CAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCG 
ATGCCGAAGGAGGTGTGGAATGA 



Fig. 61 
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ORF9--33A9SEQID NO:197 

ATGAAATCTGGCATCGCG6CCCGTCGCCTGTTCATCAACGACACCAA6GCTTTGGTGCATACCGTA6ACGG6ACCGCCAT 
GCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGA 
CCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACC 
ATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCA 
GCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6J 

ORF10-33A9 SEQ ID NO:198 

ATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGAC 
GACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGA 
CCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAG 
CAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6K 

ORF 1-33A9 SEQ ID NO:199 

MEPPMFQLLSWISRKPSPTPTTKAAPGGFILPLSSMELLGTPRRRQLLENIWQRASLSKQQFEEIYRRPLANYAELVQQL 
PASENHHHAHPGGMIDHGLEIVAYALKVRQTYLLPIGAAPESQSAQAEAWSAAAAYGALAHDIGKIVVDLQVELQDGSTW 
HPWNGPINQPYRFKYVKSREYQLHGAASALLIHQLLPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVKADQASV 
AQELGGNPDRALAAPKQSLQRQLADGLRFLVKDKFKLNQPSGPSDGWLTQDALWLVSKPAADQLRAYLLAQGIDGVPSSN 
APFFSMLQDQAVIQTNAEDKAIVrTATVDNGAGWRNKFTLLKIAPALIWTDAAERPSPYSGSLWEDGTASTEKPETTCEI 
PNGPAEQQQAPETKMMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEE 
NLQQPLGTKEPTDCAPEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHP 
VLEKLAQAKETTGWKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEG 
GVE. 

Fig. 6L 



ORF2-33A9 SEQ ID NO.-200 

MFQLLSWISRKPSPTPTTKAAPGGFILPLSSMELLGTPRRRQLLENIWQRASLSKQQFEEIYRRPLANYAELVQQLPASE 
NHHHAHPGGMIDHGLEIVAYALKVRQTYLLPIGAAPESQSAQAEAWSAAAAYGALAHDIGKIVVDLQVELQDGSTWHPWN 
GPINQPYRFKYVKSREYQLHGAASALLIHQLLPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVKADQASVAQEL 
GGNPDRALAAPKQSLQRQLADGLRFLVKDKFKLNQPSGPSDGWLTQDALWLVSKPAADQLRAYLLAQGIDGVPSSNAPFF 
SMLQDQAVIQTNAEDKAIWTATVDNGAGWRNKFTLLKIAPALIWTDAAERPSPYSGSLVVEDGTASTEKPETTCEIPNGP 
AEQQQAPETKMMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQ 
PLGTKEPTDCAPEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEK 
LAQAKETTGWKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6M 
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ORF2-33A9 SEQ ID N0:201 

MELL6TPRRRQLLENIWQRASLSKQQFEEIYRRPLANYAELVQQLPASENHHHAHPGGMIDHGLEIVAYALKVRQTYLLP 
IGAAPESQSAQAEAWSAAAAYGALAHDIGKIWDLQVELQDGSTWHPWNGPINQPYRFKYVKSREYQLHGAASALLIHQL 
LPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVRADQASVAQELGGNPDRALAAPKQSLQRQLADGLRFLVKDKF 
KLNQPSGPSDGHLTQDALHLVSKPAADQLRAYLLAQGIDGVPSSNAPFFSHLQDQAVIQTNAEDKAIWTATVDNGAGWRN 
KFTLLKIAPALIWTDAAERPSPYSGSLWEDGTASTEKPETTCEIPNGPAEQQQAPETKMMLHQPAPSVAKPANETQAIA 
KPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQPLGTKEPTDCAPEAIEDVFMPSRSTDLGQGF 
VGWMSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTGMLVQRAFEKQGLHRKTSKNLN 
IWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6N 

ORF4-33A9 SEQ ID NO:202 

MIDHGLEIVAYALKVRQTYLLPIGAAPESQSAQAEAWSAAAAYGALAHDIGKIWDLQVELQDGSTWHPWNGPINQPYRF 
KYVKSREYQLHGAASALLIHQLLPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVKADQASVAQELGGNPDRALA 
APKQSLQRQLADGLRFLVKDKFKLNQPSGPSDGWLTQDALWLVSKPAADQLRAYLLAQGIDGVPSSNAPFFSMLQDQAVI 
QTNAEDKAIWTATVDNGAGWRNKFTLLKIAPALIWTDAAERPSPYSGSLWEDGTASTEKPETTCEIPNGPAEQQQAPET 
KHMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQPLGTKEPTD 
CAPEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTG 
WKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 60 

ORF5--33A9 SEQ ID NO:203 

MLQDQAVIQTNAEDKAIWTATVDNGAGWRNKFTLLKIAPALIWTDAAERPSPYSGSLWEDGTASIEKPETTCEIPNGPA 
EQQQAPETKMMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQP 
LGTKEPTDCAPEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKL 
AQAKETTGWKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 



Fig. 6P 
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ORF6-33A9 SEQ ID NO:204 

MMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQPLGTKEPTDC 
APEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTP6IFKRYVQEHPVLEKLAQAKETTGW 
KLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6Q 

ORF7-33A9 SEQIDNO:205 

MLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQPLGTKEPTDCA 
PEAIEDVFMPSRST0L6Q6FVGfMSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLERLAQAKETT6WK 
LVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6R 

ORF8-33A9 SEQ ID NO.206 

MPSRSTDLGQGFVGWMSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTGWKLVQRAFEK 
QGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6S 

ORF9-33A9 SEQ ID NO:207 

MSGIAARRLFINDTRALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTGMLVQRAFEKQGLHRKTSKNLNIV^ 
IKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6T 

ORF10-33A9 SEQ ID NO:208 

MLVTPGIFKRYVQEHPVLEKLAQAKETTGWKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPE 
QPLDNPSLTVITDAEGGVE. 

Fig. 6U 



86/133 



M 

m 



CL 

a 



a: 
O 

o 
o 
10 

A 



Q_ 
O 

c 
E 

o 
u 

LU 

CO 



O 

o. 

m 

<3* 

o 

LO_ 

(N 

o 
o. 
o 
^ 

o 
m 

CO 

o 

o_ 

to 

CO 

o 
m 

CM" 
CO 



o 
o 
o 

CO 



o 

CM 
O 

o 

CM 



o 

CSJ 
CN 



O 

o. 
o 

CM 



O 
1^ 



o 

o. 

to 



o 

CM 



O 

O- 

O 



O 

LO- 



o 

o- 



o 

CM 



i— CM CO ^" LO O 

I I I I 1 1 



/"X 



r-i-, A" 



VJ 



< 

• MM 

LL_ 



87/133 

Sequence: 34B12 EcoRl fragment From: 1 To: 4590 

10 20 30 40 50 60 

i i i 1 1 i i i i I 1 1 1 i I i i i i I i ii i 1 i i 1 1 1 i i i i I i i i i I i 1 1 1 I i i i 1 1 1 1 i i I 1 1 i i I 

GAATTCCAT6 GC6CC6TGGA GGAGGCTTCC GAGTCGCCGG TGGCAGGCGT ACGGGCCGGC 60 
AACTACCAGG TCGACCTGGA CGATGCGAGC TTTGCCCGCC AGGTAGAACG CCTGCAGGCC 120 
CACGTGAGGG CCGGCGACGT GTTCCAGATC GTACCTTCGC GCAGCTTCAG CATGCCGTGC 180 
GCGGACCCCT GGCGGGCCTA TCGCCAGTTG TGCCTGCGCA ACCCCAGCCC GTACCGCTTC 240 
TTCCTCGATG CGGGGGACTT CTGCCTGTTC GGCGCTTCGC CGGAGTCGGC ATTGAAGTAC 300 

310 320 330 340 350 360 

i 1 1 1 1 i i i i I 1 1 1 1 I i i i 1 1 i i i i I i 1 1 1 I i i i i I i i i i I ii i 1 1 i i i 1 1 1 1 i i 1 1 i i i 1 

GACGCGGAGA GTCGCGAGGT GGAACTCTAT CCCATTGCCG GCACCCGCCC GCGCGGATGC 360 
GATGCCCGGG GCGCCATCGA TGCGGAACTG GACAATCGCC TGGAAGCGGA GTTGCGCCTG 420 
GATGCCAAGG AGATCGCCGA GCACATGATG CTGGTCGACC TGGCGCGCAA CGATCTGGCG 480 
CGCGTCTGCC GCAGCGGTAC CCGGCAGGTG CGCGACATGC TCAAGGTCGA TCGCTACAGC 540 
CACGTGATGC ACCTGGTCTC GCGCGTGGCT GGCGAACTGC ACGGCGAACT GGATGCGCTG 600 

610 620 630 640 650 660 

i 1 1 1 1 1 i i i I 1 1 1 1 I i i i 1 1 i i i i 1 1 1 1 1 I 1 1 i i I 1 1 i i I 1 1 1 1 1 i 1 1 1 1 1 1 i i 1 1 1 i 1 1 

CATGCCTACC GTGCCTGCCT GAACATGGGC ACCCTGGTCG GCGCGCCGAA GGTCCGTGCC 660 
ATGCAGTTGC TGCGGCAGTA CGAGGATGGC TATCGCGGCA GCTACGGTGG TGCGATCGGC 720 
ATTCTCGACA GCGCCGGCAA CCTCGATACC AGCATTGTCA TCCGCTCCGC CGAGGTCCGC 780 
GAAGGTATCG CGCGGGTTCG GGCAGGCGCC GGCGTGGTGC TGGATTCGGA TCCACGGCTG 840 
GAGGCCGAGG AAACCCGCAA CAAGGCGCTG GCGGTGCTGA CCGCCGTGGC CGCTGCCGAA 900 

910 920 930 940 950 960 

ii i i I i i i i I 1 1 i i I i i i i I i i i i I i 1 1 i I i i i i I i i i i I i i i i I i i i i I i i ii I i i i i I 

CGCGAAAGGG GAGAGCGCGA TGCGCATCAC GCTGTTGGAT AACTTCGATT CCTTCACCTA 960 
CAACCTGGTC GAGCAGTTCT GCCTGCTCGG CGCGGAGGTC CGGGTGATGC GCAACGATAC 1020 
GCCGTTGCCG ACGATCCAGG CGGCATTGCT GGCCGACGGT TGCGAACTGC TGGTGCTGTC 1080 
GCCGGGGCCC GGTCGGCCGG AAGACGCCGG CTGTATGCTG GAATTGCTCG CCTGGGCCCG 1140 
CGGGCGCTTG CCGGTGCTCG GCGTCTGCCT CGGCCACCAG GCGCTGGCGC TGGCCGCCGG 1200 
1210 1220 1230 1240 1250 1260 

I I I I I I I I I I I I 1 I I I I I I I I M I I I I I I I I I I I I 1 I I I I I I I 1 I I I I I I I I II I 1 II I I 

TGGCGCGGTG GGCGAGGCGA GGAAGCCGCT GCATGGCAAG AGCACGTCCC TGCGTTTCGA 1260 
TCAGCGTCAC CCGCTGTTCG ACGGCATCGC TGACCTGCGC GTCGCGCGCT ACCACTCGCT 1320 
GGTGGTCAGT CGCCTGCCGG AAGGTTTCGA CTGCCTGGCC GATGCCGATG GCGAGATCAT 1380 
GGCGATGGCC GATCCGCGCA ATCGACAGCT GGGCTTGCAA TTCCATCCCG AGTCGATTCT 1440 
CACCACCCAC GGCCAGCGTC TGCTGGAGAA CGCTCTACTC TGGTGCGGCG CGTTGGCGGT 1500 



Fig. 7B 
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Sequence: 34B12 EcoRl fragment From: 1 To: 4590 

1510 1520 1530 1540 1550 1560 

i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i I i ii i 1 1 i i i I i i i i I i i i i I 

C6CGGAGCGC CTTCGGGCCT GAGCGGCGCT GCGCAGTTTC GACCGAGGCT CGGTTGCCAG 1560 
GCCGGCGCAT CGTCGAAACG CTGGCGGCCC AGTTCGCGCA GGCGCTGGCG GGCGCTTTCG 1620 
AGAAAGCGAC GGAAGCTGCG CTCGGATTCC AGCGCGGTGT TGTAGTAGCA ATACACCTTG 1680 
GTGTCGATGC CGCCCGGTTC GTACAGTTCG CTGAGGACTG CCAGGGTACC GTTGCGCAGG 1740 
CGTTCCTCGA CGAAATAATG CGGCGAGATG CCCCATCCGA CGCCGGCTTC CACCAGACGC 1800 
1810 1820 1830 1840 1850 1860 

I I I I I I I I I I II I I I I I I I I I ' I I 1 I I I I I M I I I I I I I I I I M I I I M I I I I I I I I I I I 

AGCATGTCGT CGAAGTTTTC CACGAAGAGC ACCTTGTCGC TGACCGGCCG CAGCAGGTTC 1860 
GAATGCTGCC CGGAGCGGCT GCCGAGGCTG ATCTGCCGGT AATTGGCCAG GCTCGCGATG 1920 
CTGTGCAGGG AGGCATTGCA CAACGGGTGC TGCGGATGGG CGACGACGAA CGCCTTGGTG 1980 
TAGCCGAGCA CGCACTGGTT GAAGCGGGAG ATCTTCAGTT CCTCGTCGAT GGTGATGGCG 2040 
ATATCGATTT CCGCGTTGTC CTGCTTGATC GTCGCCAGGC TATCGGCGGG CGAGGTGCGT 2100 

2110 2120 2130 2140 2150 2160 

i i i i 1 i i i i I i i i i 1 1 i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 

ATCAGGCTGA CCATGTTGAA ATCGTCGAGC AGTACGCTGC TCACCGTATC GCAGAACGAC 2160 
GGCGGGATGG CGGTGTCCAG CAGCACCCGG AGATTGCGCG GACCCTTGTT GAGATTGAAG 2220 
GCGATGTCGC CGATCAGCTG CTGGTAGTTC AGCAGGCTGC GCATGTAAGG GATCAGGCGA 2280 
AGCGCCTGCT CGGTGGGTTC GACCTTGTAG CCGTCCCGAC GGACCAGCTC CACGCACAGG 2340 
TCGATTTCCA GGTTGCTGAC CGCCGAGCTG ACCGCGGTGT GCGACTTGCG CAGGATCCGC 2400 

2410 2420 2430 2440 2450 2460 

i i i i I i i i i I i i i i 1 1 i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 i i i i 1 1 i i i I 

GCAGCGGAGG AAATCGAACC GGAGGCGATG ACCTGGAGGA ACATGTTCAC GTGATTCAGG 2460 
TTATGAATAG GCATCCCTTA TTCCTTTTAT TGGGTGGCGC GTGCCGCTTC CCTTGATCGG 2520 
GTCAGGTTGC CGCTACTGTG GAAGAAGCGT CGAGGACTCG ATAGATAGCG CCCGAGTGTT 2580 
TCAACTTGTC TTCTGGATGA CGTTTTCATC GGGGAAACCT CCCGTCGGTC AGTGAATCGC 2640 
AAGGGCTGGC GTGCGAGGGT GGAATCGGCC GCCGGCTCGC TTTCTGCGCG GCGGGCGCAC 2700 

2710 2720 2730 2740 2750 2760 

■ ' i i 1 i i i i I i i i i 1 1 i i i I i i i i 1 1 i i i I i i i i I i i i 1 1 i i i 1 1 i i i i I i i i i 1 1 i i i I 

GGCACGGGGA GTCGTCGTTT TGGAGGTGAG GGATGACGGC TCTGTTTCAG GATATTTTTA 2760 
TAATTATGTG AAAGAAGAGC TTATTTCAAC GAAATATGTT TCATATTGCT CGTTAAATTC 2820 
GACGAAAAGA AAATCCGGAT ATTTACCGGT TATTTAACGC TAATACCAAG TGCCTAATAC 2880 
CAAAGTATTA ACGCTGGTAT GCCGGCATGT CGTGTTCGGT CGTGGAGCGA GCCGAGCTAG 2940 
GGACGGTTCT AATAAACCAA AAAATTATGT CGCGTACGTC TAACGACCGA AACCTATGTC 3000 



Fig. 7C 
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Sequence: 34B12 EcoRl fragment From: 1 To: 4590 

3010 3020 3030 3040 3050 3060 

i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i 1 1 i 1 1 i I i i i i I 

TCTTGTTAGC GTAGCCACCG GCCAGGCCGG TACGGACCCG GGATGGCCCT GGCGCGACCT 3060 
ATGCGGTTAG AATCCGCGGC CTTGCAGGCG GATACCCGAG CTTCGCTCGA AGGTGTCGCG 3120 
GTGCCGTGCC GTGGAATCGG CCGCCGGCTC GCTTTCTGCG CGGCGGGCGC ACGGCGACGG 3180 
GGAGTCGTCG TTTTGGAGGT GAGGGATGAC GGCTCTGTTT CAGGATATTT TTATAATTAT 3240 
GTGAAAGAAG AGCTTATTTC AACGAAATAT GTTTCATATT GCTCGTAAAT TCGACGAAAA 3300 

3310 3320 3330 3340 3350 3360 

i i i i I i i i i I i i i i I i i i i 1 i i i i 1 1 1 i i I i i i i I i i i i I i i i i I i i i 1 1 i i i i 1 i i i i I 

GAAAATCCGG ATATTTACCG GTTATTTAAC GTTAATACCA AGGGCCTAAT ACCAAAGTAT 3360 
TAACGCTGGC ATGCCGGCAT GTCGTGTTCG GTCGTGGAGC GAGCCGAGCC AGGAACGGTT 3420 
CTAAGAAACG AAAAAATTAT GTCGCGTAGG TCTAACGACC GAAACCTATG TCTTTTGTTA 3480 
GCGTAGCCAC CGGCCAGGCC GGTACGGATG CCGGGATGGC CCTGGCGCGA CCTATGCGGT 3540 
TAGAATCCGC GGCCTTGCAG GCGGATCCCC GGGGTTTGCT CAAGGGGACA CGGGTGCCGT 3600 

3610 3620 3630 3640 3650 3660 

i i i i I i i i i I i i i i I i i i i I i i i i I 1 1 i i I i i i i I i i i 1 1 i i i i 1 i i i 1 1 i i i i I i i i i I 

GCCCGAAACC TGCAATCGTC AGTTCCCTGC GGTCCAGCCT GCCGCCGGGT ATAAAATCGA 3660 
GAGACGCGCT GTTGCGCCTT CAGGTGTAGC GACTATGACG CACATTTCCG AACGACTCCT 3720 
GGTACAGGCC CACCTGGCCG CCAAGCAACC CCGTGTGTTG AGCGAGCAGG AGAGCGCCGA 3780 
GCATCGCGCG GCGATCGCGG CCGAACTGAA GGCGCAAAAT GCTGTACTGG TGGCGCATTA 3840 
CTACTGCGAC CCGGTGATCC AGGCGTTGGC CGAGGAGACC GGCGGTTGCG TATCCGATTC 3900 

3910 3920 3930 3940 3950 3960 

i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 

GCTGGAGATG GCCCGTTTCG GCAACCAGCA TCCGGCGCAG ACGGTGGTCG TGGCCGGGGT 3960 
GCGCTTCATG GGCGAGACGG CGAAGATCCT CAACCCTGAG AAGCGTGTGC TGATGCCGAC 4020 
CCTCGAAGCG ACCTGCTCGC TCGACCTGGG ATGCCCGGTG GATGAATTCT CGGCTTTCTG 4080 
CGACCAGCAC CCGGAACGGA CCGTGGTGGT CTATGCGAAC ACCTCCGCGG CGGTGAAGGC 4140 
ACGCGCCGAC TGGGTCGTGA CCTCCAGTTG CGCGGTGGAG ATCGTCGAAC ACCTGATGGA 4200 

4210 4220 4230 4240 4250 4260 

i i 1 1 I i i i i I ii i i I i i i i I i i i i I i i i i I i 1 1 i 1 i i i i I i i i i I i i i i I i i i i I i i i i I 

CAACGGCGAG CCCATCCTCT GGGCGCCGGA CCAGCACCTG GGACGCTACA TCCAGCGCGA 4260 
GACGGGGGCC GACATGCTGC TCTGGGATGG CGCCTGTATC GTCCACGAGG AGTTCAAGGC 4320 
CAAGCAGCTG GAAGACATGA AGGCGCTCTA CCCGGACGCC GCCATCCTGG TCCACCCCGA 4380 
ATCGCCGGAA AGCGTGGTCG CGCTGGCCGA TGCCGTGGGC TCGACCAGCC AGTTGATCAA 4440 
GGCCGCGCAG ACCCTGCCGA ACAAGACCTT CATCGTCGCC ACCGATCGCG GCATCTTCTA 4500 



Fig. 7D 
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Sequence: 34B12 EcoRl fragment From: 1 To: 4590 

4510 4520 4530 4540 4550 4560 

i i i i I i i i 1 1 1 1 1 i I i i i 1 1 i i i i I 1 1 i i I 1 1 i i I 1 1 i 1 1 i i i i I i 1 1 i I i i 1 1 1 1 1 1 1 1 

CAA6ATGCAG CAGTTGTGCC CGGACAAGGA TTTCATCGAG GCCCCCACCG CCGGCAACGG 4560 
CGCCGCCTGC CGCAGTGCGC GCACTGCCCG 4590 



Fig. 7E 
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Sequence: 34B12 ORF 1 L-S From: 1 To: 1284 

10 20 30 40 50 60 

i i i i I i i i i I 1 1 i i I i i i 1 1 i i i i 1 i i i i I i i 1 1 I i i i i I i i i i I i i ii I 1 1 i i I i i i 1 1 

AT66CAA6AG CACGTCCCTG CGTTTCGATC AGCGTCACCC GCTGTTCGAC GGCATCGCTG 60 
ACCTGCGCGT CGCGCGCTAC CACTCGCTGG TGGTCAGTCG CCTGCCGGAA GGTTTCGACT 120 
GCCTGGCCGA TGCCGATGGC GAGATCATGG CGATGGCCGA TCCGCGCAAT CGACAGCTGG 180 
GCTTGCAATT CCATCCCGAG TCGATTCTCA CCACCCACGG CCAGCGTCTG CTGGAGAACG 240 
CTCTACTCTG GTGCGGCGCG TTGGCGGTCG CGGAGCGCCT TCGGGCCTGA GCGGCGCTGC 300 

310 320 330 340 350 360 

i i i i 1 1 1 i i I i 1 1 i 1 1 i i 1 1 i i i i I i 1 1 1 I i 1 1 i 1 1 1 1 1 I 1 1 i i I i 1 1 1 1 '' i i 1 i i i 1 1 

GCAGTTTCGA CCGAGGCTCG GTTGCCAGGC CGGCGCATCG TCGAAACGCT GGCGGCCCAG 360 
TTCGCGCAGG CGCTGGCGGG CGCTTTCGAG AAAGCGACGG AAGCTGCGCT CGGATTCCAG 420 
CGCGGTGTTG TAGTAGCAAT ACACCTTGGT GTCGATGCCG CCCGGTTCGT ACAGTTCGCT 480 
GAGGACTGCC AGGGTACCGT TGCGCAGGCG TTCCTCGACG AAATAATGCG GCGAGATGCC 540 
CCATCCGACG CCGGCTTCCA CCAGACGCAG CATGTCGTCG AAGTTTTCCA CGAAGAGCAC 600 

610 620 630 640 650 660 

i i i i I 1 1 i i I i 1 1 1 1 i i i 1 1 i i i i I i i i i I i i 1 1 I i i i i 1 1 1 i i I i i i 1 1 1 1 i i I i i i i I 

CTTGTCGCTG ACCGGCCGCA GCAGGTTCGA ATGCTGCCCG GAGCGGCTGC CGAGGCTGAT 660 
CTGCCGGTAA TTGGCCAGGC TCGCGATGCT GTGCAGGGAG GCATTGCACA ACGGGTGCTG 720 
CGGATGGGCG ACGACGAACG CCTTGGTGTA GCCGAGCACG CACTGGTTGA AGCGGGAGAT 780 
CTTCAGTTCC TCGTCGATGG TGATGGCGAT ATCGATTTCC GCGTTGTCCT GCTTGATCGT 840 
CGCCAGGCTA TCGGCGGGCG AGGTGCGTAT CAGGCTGACC ATGTTGAAAT CGTCGAGCAG 900 

910 920 930 940 950 960 

i i i i I i i i i I i i i i I i i i 1 1 i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 i i i 1 1 

TACGCTGCTC ACCGTATCGC AGAACGACGG CGGGATGGCG GTGTCCAGCA GCACCCGGAG 960 
ATTGCGCGGA CCCTTGTTGA GATTGAAGGC GATGTCGCCG ATCAGCTGCT GGTAGTTCAG 1020 
CAGGCTGCGC ATGTAAGGGA TCAGGCGAAG CGCCTGCTCG GTGGGTTCGA CCTTGTAGCC 1080 
GTCCCGACGG ACCAGCTCCA CGCACAGGTC GATTTCCAGG TTGCTGACCG CCGAGCTGAC 1140 
CGCGGTGTGC GACTTGCGCA GGATCCGCGC AGCGGAGGAA ATCGAACCGG AGGCGATGAC 1200 

1210 1220 1230 1240 1250 1260 

i i i i 1 1 1 i i I 1 1 1 1 1 i i i 1 1 i i 1 1 I i i i i I i i i 1 1 1 i i i I i 1 1 1 1 i i i 1 1 1 1 i 1 1 i i 1 1 1 

CTGGAGGAAC ATGTTCACGT GATTCAGGTT ATGAATAGGC ATCCCTTATT CCTTTTATTG 
GGTGGCGCGT GCCGCTTCCC TTGA 1284 



Fig. 7F 
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Sequence: 34B12 ORF 1 |L-S) PROTEIN From: 1 To: 427 

10 20 30 40 50 

i i i i I i i i i I i i i i I i 1 1 i I i i i i I i i ii I 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 i i I 

MARARPCVSI SVTRCSTASL TCASRATTRW WSVACRKVST AWPMPMARSW 
RWPIRAIDSW ACNSIPSRFS PPTASVCWRT LYSGAARWRS RSAFGPERRC 
AVSTEARLPG RRIVETLAAQ FAQALAGAFE KATEAALGFQ RGVWAIHLG 
VDAARFVQFA EDCQGTVAQA FLDEIMRRDA PSDAGFHQTQ HWEVFHEEY 
LVADRPQQVR MLPGAATEAD LPVIGQARDA VQGGIAQRVL RMGDDERLGV 

260 270 280 290 300 

i i i i I i i i i I i i i i I i i 1 1 I i i i i I i i i i I i i i 1 1 1 1 i i I 1 1 1 1 1 1 1 i 1 1 

AEHALVEAGD LQFLVDGDGD IDFRWLLDR RQAIGGRGAY QADHVEIVEQ 
YAAHRIAERR RDGGVQQHPE IARTLVEIEG DVADQLLWQ QAAHVRDQAK 
RLLGGFDLVA VPTDQLHAQV DFQVADRRAD RGVRLAQDPR SGGNRTGGDD 
LEEHVHVIQV MNRHPLFLLL GGACRFP 427 



Fig. 7G 
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Sequence: 34B12 ORF 2 From: 1 To: 1035 

10 20 30 40 50 60 

i i i i I i i i i I i i i i I i i i i I i i i i I i 1 1 i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 

ATGCCTATTC ATAACCTGAA TCACGTGAAC ATGTTCCTCC AGGTCATCGC CTCCGGTTCG 60 
ATTTCCTCCG CTGCGCGGAT CCTGCGCAAG TCGCACACCG CGGTCAGCTC GGCGGTCAGC 120 
AACCTGGAAA TCGACCTGTG CGTGGAGCTG GTCCGTCGGG ACGGCTACAA GGTCGAACCC 180 
ACCGAGCAGG CGCTTCGCCT GATCCCTTAC ATGCGCAGCC TGCTGAACTA CCAGCAGCTG 240 
ATCGGCGACA TCGCCTTCAA TCTCAACAAG GGTCCGCGCA ATCTCCGGGT GCTGCTGGAC 300 

310 320 330 340 350 360 
i i i i I i i i i I i i i i I i i i i I i i i i I i 1 1 i I i i i i I i i i i I i ii i I i i i i I i i i i I i i i i 1 

ACCGCCATCC CGCCGTCGTT CTGCGATACG GTGAGCAGCG TACTGCTCGA CGATTTCAAC 360 
ATGGTCAGCC TGATACGCAC CTCGCCCGCC GATAGCCTGG CGACGATCAA GCAGGACAAC 420 
GCGGAAATCG ATATCGCCAT CACCATCGAC GAGGAACTGA AGATCTCCCG CTTCAACCAG 480 
TGCGTGCTCG GCTACACCAA GGCGTTCGTC GTCGCCCATC CGCAGCACCC GTTGTGCAAT 540 
GCCTCCCTGC ACAGCATCGC GAGCCTGGCC AATTACCGGC AGATCAGCCT CGGCAGCCGC 600 

610 620 630 640 650 660 
i i i i I i i i i I i i i i I i i i i I i i i i I i 1 1 i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 

TCCGGGCAGC ATTCGAACCT GCTGCGGCCG GTCAGCGACA AGGTGCTCTT CGTGGAAAAC 660 
TTCGACGACA TGCTGCGTCT GGTGGAAGCC GGCGTCGGAT GGGGCATCTC GCCGCATTAT 720 
TTCGTCGAGG AACGCCTGCG CAACGGTACC CTGGCAGTCC TCAGCGAACT GTACGAACCG 780 
GGCGGCATCG ACACCAAGGT GTATTGCTAC TACAACACCG CGCTGGAATC CGAGCGCAGC 840 
TTCCGTCGCT TTCTCGAAAG CGCCCGCCAG CGCCTGCGCG AACTGGGCCG CCAGCGTTTC 900 
910 920 930 940 950 960 

i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I — i i i i I i i i i I 

GACGATGCGC CGGCCTGGCA ACCGAGCCTC GGTCGAAACT GCGCAGCGCC GCTCAGGCCC 960 
GAAGGCGCTC CGCGACCGCC AACGCGCCGC ACCAGAGTAG AGCGTTCTCC AGCAGACGCT 1020 
GGCCGTGGGT GGTGA 1035 



Fig. 7H 
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Sequence: 34B12 ORF 2 PROTEIN From:: 1 To: 344 

10 20 30 40 50 60 

i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 1 i i i I i i i i 1 1 i i i I 

MPIHNLNHVN MFLQVIASGS ISSAARILRK SHTAVSSAVS NLEIDLCVEL VRRDGYKVEP 60 
TEQALRLIPY MRSLLNYQQL I6DIAFNLNK 6PRNLRVLLD TAIPPSFCDT VSSVLLDDFN 120 
MVSLIRTSPA DSLATIKQDN AEIDIAITID EELKISRFNQ CVLGYTKAFV VAHPQHPLCN 180 
ASLHSIASLA NYRQISLGSR SGQHSNLLRP VSDKVLFVEN FDDMLRLVEA GVGWGIAPHY 240 
FVEERLRNGT LAVLSELYEP GGIDTKVYCY YNTALESERS FRRFLESARQ RLRELGRQRF 300 
310 320 330 340 350 360 

I ' I I I I I I I 1 I M I I I I I I I I I I I I I I I I I I I I I I M II I I I M I I I I I I I I I I I I I I I I 

DDAPAWQPSL GRNCAAPLRP EGAPRPPTRR TRVERSPADA GRGW 344 



Fig. 71 
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Sequence: 34B12 ORF 1 From:: 1 To: 759 

10 20 30 40 

i i i i I i i i i I i i i i I i i i i I 1 1 1 1 1 i i i i I i 1 1 i I 1 1 i i I 

ATGCGGCGAG ATGCCCCATC CGACGCCGGC TTCCACCAGA 40 
CGCAGCATGT CGTCGAAGTT TTCCACGAAG AGCACCTTGT 80 
CGCTGACCGG CCGCAGCAGG TTCGAATGCT GCCCGGAGCG 120 
GCTGCCGAGG CTGATCTGCC GGTAATTGGC CAGGCTCGCG 160 
ATGCTGTGCA GGGAGGCATT GCACAACGGG TGCTGCGGAT 200 

210 220 230 240 

i i i 1 1 i i i 1 1 i i i i I i i i i I 1 1 i 1 1 i i i i I i 1 1 i I i i i i I 

GGGCGACGAC GAACGCCTTG GTGTAGCCGA GCACGCACTG 240 
GTTGAAGCGG GAGATCTTCA GTTCCTCGTC GATGGTGATG 280 
GCGATATCGA TTTCCGCGTT GTCCTGCTTG ATCGTCGCCA 320 
GGCTATCGGC GGGCGAGGTG CGTATCAGGC TGACCATGTT 360 
GAAATCGTCG AGCAGTACGC TGCTCACCGT ATCGCAGAAC 400 

410 420 430 440 
i i i i I i i i i I i i i i I i i i i I 1 1 i i I i i i i I i i i 1 1 i i i i 1 

GACGGCGGGA TGGCGGTGTC CAGCAGCACC CGGAGATTGC 440 
GCGGACCCTT GTTGAGATTG AAGGCGATGT CGCCGATCAG 480 
CTGCTGGTAG TTCAGCAGGC TGCGCATGTA AGGGATCAGG 520 
CGAAGCGCCT GCTCGGTGGG TTCGACCTTG TAGCCGTCCC 560 
GACGGACCAG CTCCACGCAC AGGTCGATTT CCAGGTTGCT 600 

610 620 630 640 

i i ii I i i i i I i i i i I i i i i I 1 1 1 1 I i i i i 1 i i i 1 1 i i i i I 

GACCGCCGAG CTGACCGCGG TGTGCGACTT GCGCAGGATC 640 
CGCGCAGCGG AGGAAATCGA ACCGGAGGCG ATGACCTGGA 680 
GGAACATGTT CACGTGATTC AGGTTATGAA TAGGCATCCC 720 
TTATTCCTTT TATTGGGTGG CGCGTGCCGC TTCCCTTGA 759 



Fig. 7 J 
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Sequence: 34B12 ORF 1 S PROTEIN From:: 1 To: 253 

10 20 30 40 

i i i i I i i i i 1 i i i i I i i i i I i i i 1 1 i i i i I i i i 1 1 i i i i I 

MRRDAPSDAG FHQTQHWEV FHEEHLVADR PQQVRMLPGA 40 
AAEADLPVIG QARDAVQGGI AQRVLRHGDD ERLGVAEHAL 80 
VEAGDLQFLV DGDGDIDFRV VLLDRRQAIG GRGAYQADHV 120 
EIVEQYAAHR IAERRRDGGV QQHPEIARTL VEIEGDVADQ 160 
LLWQQAAHV RDQAKRLLGG FDLVAVPTDQ LHAQVDFQVA 200 

210 220 230 240 

i i i i I i i i i I i i i i I i i i i I i i 1 1 I i i ii I i i i i 1 1 i i i I 

DRRADRGVRL AQDPRSGGNR TGGDDLEEHV HVIQVMNRHP 240 
LFLLLGGACR FP. 253 



Fig. 7K 
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pho34B12 0RF1 (L-S) SEQ 


ID NO:107 








1 


MARARPCVSI 


SVTRCSTASL 


TCASRATTRW 


WSVACRKVST 


AWPMPMARSW 


51 


RWPIRAIDSH 


ACNSIPSRFS 


PPTASVCWRT 


LYSGAARWRS 


RSAFGPERRC 


101 


AVSTEARLPG 


RRIVETLAAQ 


FAQALAGAFE 


KATEAALGFQ 


RGVWAIHLG 


151 


VDAARFVQFA 


EDCQGTVAQA 


FLDEIMRRDA 


PSDAGFHQTQ 


HWEVFHEEY 


201 


LVADRPQQVR 


MLPGAATEAD 


LPVIGQARDA 


VQGGIAQRVL 


RNGDDERLGV 


251 


AEHALVEAGD 


LQFLVDGDGD 


IDFRWLLDR 


RQAIGGRGAY 


QADHVEIVEQ 


301 


YAAHRIAERR 


RDGGVQQHPE 


IARTLVEIEG 


DVADQLLWQ 


QAAHVRDQAK 


351 


RLLGGFDLVA 


VPTDQLHAQV 


DFQVADRRAD 


RGVRLAQDPR 


SGGNRTGGDD 


401 


LEEHVHVIQV 


MNRHPLFLLL 


GGACRFP* 







Fig. 8 

phoB12 0RF2 SEQ ID NO: 108 

1 MPIHNLNHVN MFLQVIASGS ISSAARILRK 

51 VRRDGYKVEP TEQALRLIPY MRSLLNYQQL 

101 TAIPPSFCDT VSSVLLDDFN MVSLIRTSPA 

151 EELKISRFNQ CVLGYTKAFV VAHPQHPLCN 

201 SGQHSNLLRP VSDKVLFVEN FDDMLRLVEA 

251 LAVLSELYEP GGIDTKVYCY YNTALESERS 

301 DDAPAWQPSL GRNCAAPLRP EGAPRPPTRR 

Fig. 9 



36A4 SEQ ID NO: 109 

1 AAGGGTTTTG GCGGGGTCAT CCGAGTGACC CTGAGCATGC TCCTGGCGAT 
51 CTTCTTGTCG GTGCTGCTGG CGCCGGTGCG CATGCTGTTC CACACCCGCT 
101 TCGTGCTGGC CGCCTTCCTC GGCTGGTC 

Fig. 10 

36A4 SEQ ID NO:110 

1 KGFGGVIRVT LSMLLAIFLS VLLAPVRMLF HTRFVLAAFL GW 



SHTAVSSAVS NLEIDLCVEL 
IGDIAFNLNK GPRNLRVLLD 
DSLATIKQDN AEIDIAITID 
ASLHSIASLA NYRQISLGSR 
GVGWGIAPHY FVEERLRNGT 
FRRFLESARQ RLRELGRQRF 
TRVERSPADA GRGW* 



Fig. 11 
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contig 2507 SEQ ID NO: 111 

1 CTACTGGGGC AAGCTGAAGA CGCCGTTCAA GCTGAGCTTC TATCACCAGG 

51 GCATGCACTT CGACACGCCG GTGAAGATCA ACGAGGTGAC CGCTACCACG 

101 GTCAAGCCGA TCAAGTACGA TCGCACCAAG TTCGATTTCG GATCCCTGAA 

151 GTTCGACGAG AATGCCACCA AGGATCTCGG CTATGCCGGT TTCCGCGTGC 

201 TCTATCCGAT CAACAAGGCC GACAAGCAGG ACGAGATCGC CACCTTCCTT 

251 GGCGCGAGCT ACTTCCGCGT GGTCGGCAAG GGCCAGGTCT ACGGTCTGTC 

301 GGCGCGCGGC CTGGCGATCG ATACCGCGCT GCCTTCGGGC GAAGAGTTCC 

351 CGCGCTTCCG CGAATTCTGG ATCGAGCGGC CGAAGCGCAG GACAAGCAAC 

401 TGGTGATCTA CGCCCTGCTC GACTCGCCGC GGGCCACCGG CGCCTACCGC 

451 TTCGTGCTGC GTCCGGGCAA GGATGCGGTG ATGGATGTCC AGGCCCGCGT 

501 GTTCCTCCGC GACAAGGTCA GCAAGCTGGG CCTGGCGCCG CTGACCAGCA 

551 TGTACCTGTT CGGCTCCAAC CAGCCGTCCG AGCAGCACAA CTTCCGGCCC 

601 GAGCTGCATG ACTCCAGCGG CCTGCAGATC CATGCCGGCA ACGGCGAGTG 

651 GCTGTGGCGT CCGCTGAACA ATCCGAAGCA CCTGTCGGTG AGCACCTTCA 

701 GCGTGGAGAA CCCGAAAGGC TTCGGCCTGC TCCAGCGCGG CCGCGAGTTC 

751 TCCCGCTACG AAGACCTGGA TGACCGCTAC GACCTGCGTC CGAGTGCCTG 

801 GATCGAGCCG AAGGGCGACT GGGGCAAGGG CACCGTGGAA CTGGTGGAAA 

851 TCCCGACCCC GGACGAAACC AACGACAATA TCGTCGCGTT CTGGAACCCC 

901 GAGACCCAGC CTGAGGTCGG AAAGCCGCTG GACTTCGCCT ACCGCCTGCA 

951 CTGGACCATG GATGAAGACG AGCTGCACGA CCCGAAATCC TCCTGGGTCA 

1001 AGCAGACCAT GCGCTCGGTC GGCGACGTGA AGCAGAAGAA CCTGATCCGC 

1051 CAGCAGGACG GCAGCACCGC CCTGGTCGTC GACTTCGAAG GGCCGGCCCT 

1101 GAAGGACCTG GCGCCGGACG CGCCGGTGAC CACCCAGGTC AGCACCGACA 

1151 GCAACGCCGA GGTGGTGGAG AACAGCCTGC GTTACAACCC GGTCCTGAAA 

1201 GGCTGGCGCC TGACGCTGCG GATCAAGGTC AAGGATCCGA AGAAGCCGGT 

1251 GGAAATGCGC GCGGCGCTGG TCGACGAGGC GCAGAAGCCA CTGAGCGAAA 

1301 CCTGGAGCTA TCAGCTGCCT GCCGATGAAT AACCCATCCA CTACGAAAGC 

1351 ACCGCTGGCC GACTACCTCG CTCATCTTCC CCTGGCGGAA GAGGAGCGGG 

1401 AGCGCCTTGG CGAGTCCGCT TCCTTCTCCG AGCTGCACGC TCGCCTGGCG 

1451 GGAGCGGAAG GCGCCGCTGC CGATGCCGGG GGCGATCCCG CCCTGGCCTC 

1501 GGTACGCGCC CGCCTGCAGC TGGGCACCCC TGAGCTGGAC GACGCCGAGA 

1551 TGTTCGGCGT CGACGCCCAG GGTCGCACCT TCCTCAAGAT TTCCCCGCCG 

1601 ATCCGCCGTA CCAAGGTGAT TCCCGAGCCC TGGCGCACCA ACATCCTGGT 

1651 GCGCGGCTGG CGTCGGCTGA CCGGACGCAG CAACCCGCCC AAGCCCAAGC 

1701 GTGCCCTGCC GCGGGCCCGC TGGCAGCGGG TCGGCTCGCT GCGCCGGTTC 

1751 ATCCTGCTGT TGTTGATGCT GGCGCAGACC TCGGTCGCCA CCTACTACAT 

1801 GAAAGGCATC CTGCCCTACC AGGGCTGGGC CTTCGTCGAC CTGGAGGAGC 

1851 TGGCCCAGCA GAGCCTGCTG GATACCGTCC AGCAGGTGCT GCCCTATGTC 

1901 ATCCAGTTCG GCATCCTGGC GCTCTTCGCG ATCCTCTTCT GCTGGGTCTC 

1951 GGCCGGCTTC TGGACCGCGC TGATGGGCTT CTGGGAGCTG CTCACCGGGC 

2001 GTGACCGCTA CCGGATCTCC GGCAGCAGCG CCGGCAGCGA GCCGATCGCC 

2051 GCCGACGCCC GCACGGCGAT CGTCATGCCG ATCTGCAACG AAGACGTGCC 

2101 GCGGGTATTC GCCGGCCTGC GGGCGACCGT CGAGTCGATG GCCGCCACCG 

2151 GCGAGATGGA GCGCTTCGAC TTCTTCGTCC TCAGCGACAC CAACGACCCG 

2201 GATATCGCCG TCGCCGAGCA GCAGGCCTGG CTCGAGCTGT GCCGCGAGAC 

2251 CAAGGGCTTC GGCAAGATCT TCTACCGTCG CCGCCGGCGC CGGGTGAAGC 

2301 GCAAGAGCGG CAACATCGAC GACTTCTGCC GGCGCTGGGG CGGCGACTAC 

2351 CGCTACATGG TGGTGATGGA CGCCGACAGC GTGATGAGCG GCGACTGCCT 



Fig. 12A 
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2401 GGCCAAGCTG GTACGCCTGA TGGAGGCCAA TCCTGAGGCG GGGATCATCC 
2451 AGACCGCGCC GAAGGCTCCG GCATGGACAC CCTGTATGCG CGCATGCAGC 
2501 AGTTCGCCAC CCGCGTCTAC GGCCCGCTGT TCACCGCCGG CCTGCACTTC 
2551 TGGCAACTCG GCGAGTCGCA CTACTGGGGC CACAACGCGA TCATCCGCAT 
2601 GCAGCCCTTC ATCGACCACT GCGCCCTGGC GCCGTTGCCG GGCAAGGGCT 
2651 CGTTCGCCGG CGCGATCCTG TCCCACGACT TCGTCGAGGC TGCGTTGATG 
2701 CGCCTTGCCG GCTGGGGCGT GTGGATCGCC TACGACTTCG ACGGCAGCTA 
2751 CGAAGAACTG CCGCCGAACC TGCTCGACGA ACTCAAGCGC GACCGCCGCT 
2801 GGTGCCACGG CAACCTGATG AACTTCCGCC TGTTCCTGGT CAAGGGCATG 
2851 CACCCGGTGC ACCGCGCGGT GTTCCTCACC GGGGTCATGT CCTACCTGTC 
2901 GGCGCCGTTG TGGTTCTTCT TCCTGGTGCT TTCCACCGCG CTGCTGGCGG 
2951 TGCACCAACT GATGGAGCCG CAGTACTTCC TGGAACCGCG GCAGCTGTTC 
3001 CCGATCTGGC CGCAGTGGCA TCCGGAGAAG GCCATCGCGT TGTTCTCCAC 
3051 CACCCTGACC CTGTTGTTCC TGCCCAAGCT GCTCAGCGTA ATGCTGATCT 
3101 GGGCCAAGGG CGCCAAGGGT TTCGGCGGGG TGATCCGGGT GACCCTGAGC 
3151 ATGCTCCTGG AGATGTTCTT CTCGGTGCTG CTGGCGCCGG TGCGCATGCT 
3201 CTTCCACACC CGCTTCGTGC TGGCCGCCTT CCTCGGCTGG TCGGTGCAGT 
3251 GGAACTCGCC GCAGCGCGAC GACGACGCCA CGCCCTGGAG CGAGGCGATC 
3301 CGCCGGCACG CAATGCAGAC CCTGCTGGGT ATCGCCTGGA CCCTGCTGGT 
3351 GGCCTGGCTC AACCCGCGCT TCCTGTGGTG GCTGTCGCCG ATCGTCGGTT 
3401 CGCTGATCCT GTCGATCCCG GTATCGGTGA TCTCCAGCCG GGTGAAGCTG 
3451 GGCCTGCGGG CCCGCTACGA AAAGCTGGTC CTGATCCCGG AGAGTACGAC 
3501 ACGCCGCGCG ACTGCGCGCC ACCGACGAGT ACACCTACGA GAACCGCTGG 
3551 CATGCGCTCA AGGATGGCTT CCTCAAGGCC GCCGTCGATC CGTTGCTCAA 
3601 CGCCCTGGCC TGCGCCATGG GCACGGCTCG CCACAACCGT GCGCAGGCCA 
3651 TCGAGACGGT GCGTGGCGAG CGTATCGGCA AGGCCATCGA TAAGGGCCCG 
3701 GAACAGCTCG ACGGCGCCAC GCGCCTGGCT CTGTTGAGTG ACCCGGTAGC 
3751 ACTTTCGCGC CTGCATACGC GGGTCTGGGA AGAGGACCGC GACGACTGGC 
3801 TCGGCCGCTG GCGCAAGGCC GAGGCGGACG ACCCCCACGC CGCCAGCGTA 
3851 CCGCTGGCCC AGGTAGTGCC CGGCGACGCC GGCCTGCTGC CCGCCGCCCA 
3901 GTCCTGATCC CATGCCCCCG GCGGAACGCC GCCGGGGGCA TGGGTCTGTT 
4001 TCTTGCCTGT TTTCCCCGTG CGGCGCTGCT GTTACCCTGC GCCGGCAATC 
4051 CAGAAAGTCT CGTATCGTTC GCCAGCTGAG GTACTATCGG CCGCCTTTTG 
4101 CGCAGCCGGT CATGGCCTGC TGCCCGCCCG GGACGGCGAC ACGACGAGAG 
4151 CATCCGTTCG ACGACTGTGT TTCTAAGACT GCTGGGGATT GGGGAATGAA 
4201 AAAGTATCTT GCTTCATTGG TTCTGGGCGT CTGCGCCCTG GTGGGCGTGG 
4251 CTTCGGTCCA GGCGGCCGGC GCGGTGGAGG ACGCGGTCAA GCGCGGCACC 
4301 CTGCGGGTCG GCATGGACCC GACCTACATG CCGTTCGAGA TGACCAACAA 
4351 GCGTGGCCAG ATCATCGGCT TCGAAGTCGA CCTGCTCAAG GCCATGGCCA 
4401 AGTCCATGGG CGTCAAGCTG GAGCTGGTCT CCACCAGCTA CGACGGCATC 
4451 ATCCCGGCGC TGCTGACCGA CAAGTTCGAC ATGATCGGCT CGGGCATGAC 
4501 CCTGACCCAG GAGCGCAACC TGCGCCTGAA CTTCTCCGAG CCCTTCATCG 
4551 TGGTCGGCCA GACCCTGCTG GTGCGCAAGG AACTGGAAGG CAAGATCAAG 
4601 TCCTACAAGG ACCTGAACGA TCCGCAGTAC AGCATCACCT CGAAGATCGG 
4651 CACCACCGGT GAGATCGTTG CCCGCAAGCT GATCAGCAAG GCCAAGTACC 
4701 ACGGCTTCGA CAACGAGCCG GAAGCGGTGA TGGACGTGGT CAACGGCAAG 
4751 GCCGACGCCT TCATCTACGA CTCGCCCTAC AACGTGGTGG CGGTGAGCAA 
4801 GTTCGGCGCC GGCAAGCTGG TCTACCTCGA CCAGCCGTTC ACCTACGAGC 
4851 CGCTGGCGTT CGGCCTGAAG AAAGGCGACT ACGACAGCAT CAATTTCATC 
4901 AACAACTTCC TCCATCAGAT CCGCGAAGAC GGCACCTATC AGCGCATCCA 

Fig. 12B 
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4951 CGACAAGTGG TTCAAGAACA 
5001 CTGACGGCCC CCGCGAAGGG 
5051 AGAGAGTTTC CGTGACCAAG 
5101 CTGACCGGGC TGATCCTGCT 
5151 CTCGCTGATT TCCTATGAAA 
5201 CTACCAGGCC GAGGAGCCGT 
5251 CTATCGAGGA ACAGGGCAGG 
5301 GCGAGAAGCA GGTCGTGACC 
5351 GCGACGACGT GGCCGAGGGC 
5401 GCCGGCACTG CTCTGGGGCC 
5451 CCGGTGCCAT CGGTCTGGCT 
5501 TCGAAGAACC CGACCCTGCA 
5551 GCGCGGCACG CCGTTGCTGG 
5601 GCACCGTGCT CAACCTGTCC 
5651 CTGTTCACCG GCGCCTACGT 
5701 CATCGCCCGC GGACAGAACG 
5751 GCCAGTCGAT GCGCTACGTG 
5801 CGCCGCTGGC CGGGCAGTTC 
5851 TCGGTGATCG CCATCACCGA 
5901 CCACTTCGTT CTCCAACTTC 
5951 CCTGCTGTTG AACCTGCCCC 
6001 GGCTCGGACA AAGTGATTGA 
6051 CCGCGGCCAG GTAGTGCGCG 
6101 GGGGCGAGGT ACTGGTGGTG 
6151 TTCCTGCGCT GCCTGAACGG 
6201 CATCGACGGC GTCGACCTGG 
6251 GCCGCGAAGT CGGCATGGTG 
6301 ACCGTGCTCG AGAACCTCTG 
6351 CAAGGCCGAG CGCGAGGCCA 
6401 TCGGGCAGAA GGCCGACGAA 
6451 CAGCGCGTGG CGATCGCTCG 
6501 GTTCGACGAA CCGACCTCGG 
6551 TCGACGTCAT GAAGACCCTG 
6601 ACCCACGAGA TGGGCTTTGC 
6651 CGACCACGGC AAGCTGCTGG 
6701 ATCCGCAGGA CCCGCGGGCC 
6751 CGCGCTAGGC GAACGGCTTG 
6801 TGCCGCGCGG CCGGCTGGAT 
6851 TTCGCGCAGG CGCTCGTTCT 
6901 GGTACTCCAT CAGAGAGTGC 
6951 AGCGCGTTG 



CCGAGTGGCT GAAGGAAATG GAATGAACCG 
GGCCGTCGTA CCTGCGCATT CCATCGTTCG 
AAGAAACGTT CCGTCTGGCC CTGGCACCTG 
GGTCATGGCC TGGGCGCTGT GGTTCTCCAC 
TGGCGTGGGA CCGCGTTTCC GAGTACTTCG 
TACGGGCCAA CGAGATCGGC CGGGTCGAGG 
GACGCGCGCG TCACGCTGCT TGGCGAGACG 
GTTGCCCAGG ACAGCCTGCA ATTCTCCGAA 
GACGCGGTCG GGGTGACCCG CCACTGGGCC 
TGTGGACCAC CCTCTGGCTA TCGCTGGTGT 
ATCGGCCTGG TCGCCGGCCT CTGCCGGCTG 
CGACCTGTCG ACGATCTACG TCGAGCTGGT 
TGCAGATCTT CATCTTCTAC TTCTTCATCG 
CGCGAGTTCG CCGGGGTTGC GGCGCTGGCG 
GGCCGAGATC ATCCGGGCCG GCGTGCAGTC 
AGGCCGCCCG CTCCCTGGGC CTGAACGCCG 
ATCCTGCCGC AGGCTTCAAG CGCGTGCTGC 
ATCAGCCTGG TCAAGGACAC CTCGCTGGTC 
ACTGACCAAG AGCGGCCGCG AGGCGATCAC 
GAGATCTGGT TTCTGCGTCG CCGCGTTGTA 
TTTCGCACAT GGCATCCCGA CTGGAGCGGA 
AGTACGCAAC CTGCTGAAGG TCTTCGATAC 
CGGTGGACGA CGTGAGTACC CGCGTGGCCA 
ATCGGTCCGT CCGGTTCCGG CAAGTCGACC 
CCTGGAGGAG TTCGACGAAG GCTCGGTGAG 
CCGACCCGAG GACCGACATC AATGCCTACC 
TTCCAGCATT TCAACCTGTT CCCGCACATG 
CCTGGCCCAA CGCGTGGTGC GCAAGCGCGG 
AGGCGCGGGC GCTGCTGGCC AAGGTCGGCA 
TATCCCTCGC GCCTGTCCGG CGGCCAGCAG 
CGCGTTGTGC ATGGACCCCA AGGTGATGCT 
CGCTCGATCC GGAGATGGTC GGCGAAGTCC 
GCCGTGGAAG GCATGACCAT GGTCTGCGTG 
CCGCGAAGTG GCCGACCGCG TGCTGTTCTT 
AGGACGCGCC GCCGGCGCAG TTCTTCGACA 
CAGGCCTTCC TCCGCCAGGT CCTCTAGTAC 
CCCGGCGGCG GCAGGAGCGA CGTCGGACTC 
ATCGTTGTCC TCCAGCCAGT CCAGCGCCCA 
GGTAGCGGTA CCAGTCCTGC AACAGTTCCG 
TTGAAGGCCT TGAACGGCTT GCGGCTCTGC 



Fig. 12C 
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23A2 DNA SEQID NO:112 

1 CGAGGTTTCC GTCTACGAAG GCACCGGCTC GGTCACCATC CGCGCCGTGT 
51 TCCCCAACCC GAACAACGAG CTGCTCCCCG GCATGTTCGT TCACGCGCAG 
101 TTGCAGG 

Fig. 13 

23A2 peptide SEQ ID N0.113 
1 EVSVYEGTGS VTIRAVFPNP NNELLPGMFV HAQLQ 



Fig. 14A 
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SEQ ID NO:148 
DNA flanking the 23A2 locus. 
mexA partial sequence, mexB partial sequence 

1 ggccaggcaa acgcgatggc caccgtgcaa cagctcgacc cgatctacgt cgacgtcacc 
61 cagccgtcca ccgccctgtt gcgcatgcgc cgcgaactgg ccagcggcca gttggagcgc 
121 gccggcgaca acgctgcgaa ggtctccctg aagctggagg acggtagcca atacccgctg 
181 gaaggccgcc tcgaattctc cgaggtttcc gtcgacgaag gcaccggctc ggtcaccatc 
241 cgcgccgtgt tccccaaccc gaacaacgag ctgctgcccg gcatgttcgt tcacgcgcag 
301 ttgcaggaag gcgtcaagca gaaggccatc ctcgctccgc agcaaggcgt gacccgcgac 
361 ctcaagggcc aggctaccgc gctggtggtg aacgcgcaga acaaggtcga gctgcgggtg 
421 atcaaggccg accgggtgat cggcgacaag tggctggtca ccgaaggcct gaacgccggc 
481 gacaagatca ttaccgaagg cctgcagttc gtgcagccgg gtgtcgaggt gaagaccgtg 
541 ccggcgaaga atgtcgcgtc cgcgcagaag gccgacgccg ctccggcgaa aaccgacagc 
601 aagggctgat caaggggatt cgtaatgtcg aagtttttca ttgataggcc cattttcgcg 
661 tgggtgatcg ccttggtgat catgctcgcg ggcggcctgt cgatcctcaa tctgccggtc 
721 aaccagtacc cggccatcgc cccgccggcc atcgccgtgc aggtgagcta cccgggcgcc 
781 tcggccgaga cggtgcagga caccgtggtc caggtgatcg agcagcagat gaacgggatc 
841 gacaatctgc gctacatctc ctcggagagt aactccgacg gcagcatgac catcaccgtg 
901 accttcgaac agggcaccga ccccgacatc gcccaggtcc aggtgcagaa caagctgcaa 
961 ctggccaccc cgctgctgcc gcaggaagtg cagcgccagg ggatccgg 

Fig. 14B 

SEQ ID NO:149 
PA14 mexA 



6 QANAMATVQ QLDPIYVDVT QPSTALLRMR RELASGQLER AGDNAAKVSL KLEDGSQYP LEGRLEFSE 
VSVDEGTGS VTIRAVFPN PNNELLPGM FVHAQLQEG VKQKAILAP QQGVTRDLK GQATALWN 
AQNKVELRV IKADRVIGD KWLVTEGLN AGDKIITEG LQFVQPGVE VKTVPAKNV ASAQKADAA PAKTDSKG 



Fig. 14C 



SEQ ID NO:150 
PA14 mexB 



MSKFFIDRPIFAWVIALVIMLAGGLSILNLPVNQYPAIAPPAIA 

VQVSYPGASAETVQDTWQVIEQQMNGIDNLRYISSESNSDGSMTITVTFEQGTDPDI 

AQVQVQNRLQLATPLLPQEVQRQGIR 



Fig. 14D 
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PAOl Phenelzine operon SEQ ID NO:114 

1 GCAAGCTCAA CTCCAGCAAC AAGGCGGAGG CCACCATGAA GGCTTACGCC 
51 ATCGGCCTGC TCAACTGAAT CGACGCCTCG TCGCCTAGCG AGGCCGCCGC 
101 GCAAGCGTCC GGCCATTCAC CGAATGGCCG GATAGCGTTT GCGCCGGTCG 
151 CCTGAGCGCA CGCTTCCCAC CGGCAGCGTT TCCCCGCTGC CCCCTTCGCC 
201 ATTGCGCCCG TCCTCATGTT GTCCGGACGC TAGTCGAACT TTCCGGGCGC 
251 CTGGCAAACC GGCCAAAGAA TAGAACGGAA TCGATGCCCA CACCTTTAAT 
301 TTTTAAGGGT TTTTCCTTTT CAAAAACCGT TATTAAGTTT TCCCCTTTAA 
351 ATCTTGGTAC AACTGGGTTC AGGCGAAACT TCGGTCATGC CATTCGGCAT 
401 TAGTTAAACT TTGAGACTCT CCAAGCGGGA ATTTTTGCCG GAACAGCTTC 
451 ACGGCATTTC TCCGCTTTCA TCCCGATGTT TCTTTCCGTT ATGATTCCAG 
501 TCGATTCGAA CTGCCGGAGT TCCCACCATT CGAGATTACC AACGTTGAAA 
551 AGGGTTTACC GACAACCTGG AATTGCGTCG GCGCAACCGT GCCACGGTCG 
601 AGCACTACAT GCGCATGAAG GGGGCCGAAC GGTTACAGCG GCACAGCCTG 
651 TTCGTCGAGG AGGCTGCGCC GGCAACTGGA CCACGGAAAG CGGCGAACCC 
701 CTGGTTTTCC GGGGCCATGA GAGCCTCAGG CGGCTCGCCG AGTGGCTCGA 
751 GCGCTGCTTC CCCGACTGGG AGTGGCACAA CGTGCGGATC TTCGAGACCG 
801 AGGATCCGAA CCACTTCTGG GTCGAGTGCG ACGGGCGCGG CAAGGCGCTG 
851 GTCCCGGGGT ATCCGCAGGG CTATTGCGAG AACCACTACA TCCATTCCTT 
901 CGAACTCGAG AACGGCCGGA TAAAACGCAA TCGCGAGTTC ATGAACCCGA 
1001 TGCAGAAATT GCGTGCATTG GGAATAGCCG TTCCACAAAT AAAACGTGAC 
1051 GGTATTCCCA CCTGATTAAT GTCTATTCCA ATTCAAGAGG AGATATGACG 
1101 ATGCTCGATA ATGCCATTCC TCAAGGTTTC GAAGACGCCG TGGAGTTGCG 
1151 CAGGAAGAAT CGCGAGACGG TGGTCAAGTA TATGAACACC AAAGGCCAGG 
1201 ATCGCCTGCG CCGCCATGAA CTTTTCGTCG AGGACGGCTG TGGCGGTTTA 
1251 TGGACCACCG ATACCGGCTC GCCCATCGTC ATTCGTGGCA AGGACAAGCT 
1301 GGCCGAGCAC GCGGTGTGGT CGCTGAAATG CTTCCCGGAT TGGGAGTGGT 
1351 ACAACATCAA GGTCTTCGAG ACCGACGATC CCAACCACTT CTGGGTCGAG 
1401 TGCGACGGCC ACGGCAAGAT CCTCTTCCCC GGCTATCCCG AGGGCTACTA 
1451 CGAGAACCAC TTCCTGCATT CCTTCGAGCT GGACGACGGC AAGATCAAGC 
1501 GCAACCGCGA ATTCATGAAC GTCTTCCAGC AATTGCGCGC CCTGAGCATT 
1551 CCGGTCCCGC AGATCAAACG CGAAGGCATT CCCACCTGAG GCCATCCTGG 
1601 AAGGGGTGAA CTATGGACGA TCTATTGCAA CGCGTACGGC GCTGCGAAGC 
1651 GCTGCAGCAA CCCGAATGGG GCGATCCGTC GCGCCTGCGC GACGTGCAGG 
1701 CGTACCTGCG CGGCAGTCCG GCGCTGATCC GCGCCGGCGA CATCCTGGCC 
1751 CTGCGCGCGA CCCTGGCGCG GGTCGCCCGC GGCGAGGCGC TGGTGGTGCA 
1801 GTGCGGCGAC TGCGCCGAGG ACATGGACGA CCACCATGCC GAGAACGTGG 
1851 CGCGCAAGGC CGCCGTGCTG GAACTGCTGG CCGGCGCCCT GCGCCTGGCC 
1901 GGCCGGCGGC CGATAGATCC GCGTCGGGCG CATCGCCGGG CAGTACGCCA 
1951 AGCCGCGTTC CAAGCCGCAC GAGCAGGTCG GCGAGCAGAC CCTGCCGGTC 
2001 TATCGCGGCG ACATGGTCAA CGGCCGCGAG GCCCATGCCG AACAGCGCCG 
2051 GGCCGATCCG CAGCGGATCC TCAAGGGCTA TGCGGCGGCG CGCAACATCA 
2101 T 



Fig. 15 
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3E8 sequence SEQIDNO:115 

1 CG6CGCCGAG GATCCGCTGT TCGAGTTAGG CGCAAGCGTC CGGCCATTCA 
51 CGGAATGGCC GGATAGCGTT TGCGCCGGTT GCTTGAGCGC AGCTTCCCAC 
101 CGGCAGGGTT TCCCCGCTGC CCCTTTCGCC ATTGCGCCGT CCTCTTGTTG 
151 TCCGGCACGC TAGTGCAACT TTCCGGACGC TTGGCAAACC GGCCAAAGAA 
201 TAGAACGGAA TCGATGCCCC ACACCTGTAA TTTTTAAGGG GTTATGGCTA 
251 TTGCAAAAAA GCGTTTATAA GTTTGTCCCC TGTCAAATCT GGTTACAACT 
301 GGGTTTCAGG CGAAACATTC GGTCATGGCA ATTCGGCATT AGTTGAAACT 
351 TTGGAGACGC TCCGAAGCGG GCAACTTTTG CCCGGAAAAA GTTTCACGGC 
401 AATTTTTCCG GCCTGTCATC CCGATGTCTT CTTTCCAGTA TGGATGCCAG 
451 TCGATTCGAA CTGGCGGAGA TTCGCACCAT GCGAGAGTAC CAACGGTTGA 
501 AAGGGTTTAC CGACAACCTG GAATTGCGGC GGCGCAACCG TGCCACGGTC 
551 GAGCACTACA TGCGCATGAA GGGGGCCGAA CGGTTGCAGC GGCACAGCCT 
601 GTTCGTCGAG GACGGCTGCG CCGGCAACTG GACCACGGAA AGCGGCGAAC 
651 CCCTGGTTTT CCGGGGCCAT GAGAGCCTCA GGCGGCTCGC CGAGTGGCTC 
701 GAGCGCTGCT TCCCCGACTG GGAGTGGCAC AACGTGCGGA TCTTCGAGAC 
751 CGAGGATCCG AACCACCTCT GGGTCGAGTG CGACGGGCGC GGCAAGGCGC 
801 TGGTCCCGGG GTATCCGCAG GGCTATTGCG AGAACCACTA CATCCATTCC 
851 TTCGAACTCG AGAACGGCCG GATAAAACGC AATCGCGAGT TCACGAACCC 
901 GATGCAGAAA TTGCGTGCAT TGGGAATAGC CGTTCCGCAA ATAAaACGTG 
951 ACGGCATTCC CACCTGATTA ATGATTATTC CAATTCAAGA GGAGATATGA 
1001 CGATGCTCGA TAATGCTATT CCCCAAGGTT TCGAAGACGC CGTGGAGTTG 
1051 CGCAGGAAGA ATCGCGAGAC GGTGGTCAAG TATATGAACA CCAAAGGCCA 
1101 GGATCGCCTG CGCCGCCATG AACTTTTCGT CGAGGACGGC TGTGGCGGTT 
1151 TATGGACCAC CGATACCGGC TCGCCCATCG TCATTCGTGG CAAGGACAAG 
1201 CTGGCCGAGC ACGCGGTGTG GTCGCTGAAA TGCCTTCCCG GATTGGGAGT 
1251 GGTACAACAT CAAGGT 



Fig. 16A 
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3E8 SEQUENCE TAG SEQIDNO:160 

1 tatggatgcc agtcgattcg aactggcgga gattcgcacc atgcgagagt accaacggtt 

61 gaaagggttt accgacaacc tggaattgcg gcggcgcaac cgtgccacgg tcgagcacta 

121 catgcgcatg aagggggccg aacggttgca gcggcacagc ctgttcgtcg aggacggctg 

181 cgccggcaac tggaccacgg aaagcggcga acccctggtt ttccggggcc atgagagcct 

241 caggcggctc gccgagtggc tcgagcgctg cttccccgac tgggagtggc acaacgtgcg 

301 gatcttcgag accgaggatc cgaaccacct ctgggtcgag tgcgacgggc gcggcaaggc 

361 gctggtcccg gggtatccgc agggctattg cgagaaccac tacatccatt ccttcgaact 

421 cgagaacggc cggataaaac gcaatcgcga gttcacgaac ccgatgcaga aattgcgtgc 

481 attgggaata gccgttccgc aaataaaacg tgacggcatt cccacctgat taatgattat 

541 tccaattcaa gaggagatat gacgatgctc gataatgcta ttccccaagg tttcgaagac 

601 gccgtggagt tgcgcaggaa gaatcgcgag acggtggtca agtatatgaa caccaaaggc 

661 caggatcgcc tgcgccgcca tgaacttttc gtcgaggacg gctgtggcgg tttatggacc 

721 accgataccg gctcgcccat cgtcattcgt ggcaaggaca agctggccga gcacgcggtg 

781 tggtcgctga aatgcttccc ggattgggag tggtacaaca tcaaggtctt cgagaccgac 

841 gatcccaacc acttctgggt cgagtgcgac ggccacggca agatcctctt ccccgggtat 

901 cccgagggtt actacgagaa ccacttcctg cattccttcg agctggacga cggcaagatc 

961 aagcgcaacc gcgaattcat gaacgtcttc cagcaattgc gcgccctgag cattccggtc 

1021 ccgcagatca aacgcgaagg cattcccacc tgaggccatc ctggaagggg tgaactatgg 

1081 acgatctatt gcaacgcgta cggcgctgcg aagcgctgca gcaacccgaa tggggcgatc 

1141 cgtcgcgcct gcgcgacgtg caggcgtacc tgcgcggcag tccggcgctg atccgcgccg 

1201 gcgacatcct ggccctgcgc gcgaccctgg ccgggtcgcc cgcggcgagg cgctggtggt 
1261 gcagtgcggc gactgcgccg aggacatgga cgaccacca 



Fig. 16B 
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3E8phzA SEQIDNO:116 

1 MREYQRLKGF TDNLELRRR6 SAVRVRRKRP AIHGHA6ZRL RRLLERSFPP 
51 A6FPRCPFRH CAVLLLSGTL VQLSGRLANR PKNRTESMPH TCNFZGVMAI 
101 AKKRLZVCPL SNLVTTGFQA KHSVMAIRHZ LKLWRRSEAG NFCPEKVSRQ 
151 FFRPVIPMSS FQYGCQSIRT GGDSHHARVP TVERFTDNLE LRRRNRATVE 
201 HYMRMKGAER LQRHSLFVED GCAGNWTTES GEPLVFRGHE SLRRLAEWLE 
251 RCFPDWEWHN VRIFETEDPN HLWVECDGRG KALVPGYPQG YCENHYIHSF 
301 ELENGRIKRN REFTNPMQKL RALGIAVPQI KRDGIPTZLM IIPIQEEIZR 
351 CSIMLFPKVS KTPWSCAGRI ARRWSSIZTP KARIACAAMN FSSRTAVAVY 
401 GPPIPARPSS FVARTSWPST RCGRZNAFPD WEWYNIK 



Fig. 17 
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3E8 phzB SEQ ID NO:117 

1 MLDNAIPQGF EDAVELRRKN RETWKYMNT KGQDRLRRHE LFVEDGCGGL 
51 WTTDTGSPIV IRGKDKLAEH AVWSLKCLPG LGWQHQG 



Fig. 18A 



3E8 PHZA SEQ ID NO:161 

MREYQRLKGFTDNLELRRRNRATVEHYMRMKGAERLQRHSLFVE 

DGCAGNWTTESGEPLVFRGHESLRRLAEWLERCFPDWEWHNVRIFETEDPNHLWVECD 

GRGKALVPGYPQGYCENHYIHSFELENGRIKRNREFTNPMQKLRALGIAVPQIKRDGIPT 



Fig. 18B 



PhzB SEQ ID NO:162 



MLDNAIPQGFEDAVELRRIOroETVVKYMNTKGQDRLRRHELFVEDGCGGLWTTDTGSPIVIRGKDKLAEHAVWSLKCF 
PDWEWYNIKVFETDDPNHFWVECDGHGKILFPGYPEGYYENHFLHSFELDDGKIKRNREFMNVFQQLRALSIPVPQIK 
REGIPT 



Fig. 18C 



PhzC SEQ ID NO:163 

HDDLLQRVRRCEALQQPEWGDPSRLRDVQAYLRGSPALIRAGDILALRATLAGSPAARRWWCSAATAPRTNTTT 



Fig. 18D 



PA14 phzR SEQ ID N0.164 
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phzR DNA sequence : 1161 bp 

CGTCGACGAG6CCCGC CATGGGCCAAGGTTTGTTGT CGGGAGGCgCTCCCGACGACGATG 
GAGCGTGCGAGAAGAACAATGAGAAAGACCGCCGTGAGGCCCATCGGAGAGCCGTTCTAC 
GGTTTCCGCAAAGATCCGGGGCGCCGTCCCCTCCAGCa CAGCGCAGTTCCTGCGCGGCGC 
CTCGTGTCCGTGCTCATCGAGAAGTTCTCTTCAGCCTCGTTTCGTCGTCGCCCGGCGGGC 
GGCGAATGGGCTCGACCTCGTCCGGAACACCCGCACAGGGCCGGTGGCGATATGTACTTC 
CAGGTCCGGCTTGATAAAGGGAATTGTCATGAGTGGATAAGACGGAAACAAAAAAGAATA 
AAAACGCTGAAGAACCGAATCCTGCCGGGATCGATTGTTGACTGGTGAAGCTGGCATGCA 
TGATGAGAGAGAGGGATATCTCGAGATTTTGTCAAGAATAACAACCGAGGAAGAGTTCTT 
CTCCCTGGTTCTCGAGATATGCGGTAATTATGGATTCGAATTCTTTTCATTCGGTGCGCG 
GGCGCCTTTCCCGCTGACCGCGCCTAAATATCATTTCCTGTCCAATTACCCAGGGGAATG 
GAAAAGCAGATATATCTCCGAAGACTACACATCCATCGACCCGATCGTGCGCCATGGTCT 
CCTGGAATACACCCCGCTGATCTGGAATGGCGAAGACTTCCAGGAGAACCGTTTCTTCTG 
GGAGGAAGCGCTGCATCACGGCATCCGTCACGGCTGGTCGATCCCGGTCCGCGGCAAGTA 
CGGGCTGATCAGCATGCTGTCCCTGGTGCGTTCCAGCGAGAGCATCGCCGCTACGGAAAT 
CCTGGAGAAGGAATCCTTCCTGCTCTGGATCACCAGCATGCTGCAGGCTACCTTCGGCGA 
CCTGCTGGCGCCGCGCATCGTCCCGGAAAGCAATGTGCGCCTGACCGCCAGGGAAACCGA 
..GATGCTCAAGTGGACCGCGGTGGGCAAGACCTACGGCGAGATCGGCCTGATCCTGTCGAT 
CGACCAGCGCACGGTGAAATTCCATATCGTCAATGCGATGCGCAAGCTCAACTCCAGCAA 
CAAGGCGGAGGCCACCATGAAGGCCTACGCCATCGGCCTGCTCAACTGAATCGACGCCTC 
GTCGCCTAGCGAGGCCGCCGC 

Fig. 18E 



PA14 PhzR SEQ ID NO:165 
PhzR peptide sequence 

MHDEREGYLEILSRITTEEEFFSLVLEICGNYGFEFFSFGARAPFPLTAPKYHFLSNYPG 
EWKSRYISEDYTSIDPIVRHGLLEYTPLIWNGEDFQENRFFWEEALHHGIRHGWSIPVRG 
KYGLISMLSLVRSSESIAATEILEKESFLLWITSMLQATFGDLLAPRIVPESNVRLTARE 
TEMLKWTAVGKTYGEIGLILSIDQRTVKFHIVNAMRKLNSSNKAEATMKAYAIGLLNZ 

Fig. 18F 



34H4 SEQ ID N0:118 
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1 ACCAACATCC TGGTCCTGA6 CAACAGCCAG CGCCACGGCC TGGCCGCCGC 

51 CTGGCCGATC GTGCTCGGCG CCTGCGCGGC GGTGGCGGCG CTGATCCTGC 

101 TGCTCGGGCT CGGCCTGGGC GAGCTGCTGC GGCGCCACCC GTTGCTCCAG 

151 CAGGGGCTCG CCTGGCTTGG CGTCGGCTGG CTCAGCTACC TGGCCTGGAG 

201 CCTGTTCCGC AGCGCG 



Fig. 19 

33C7 SEQ ID NO:119 



1 CCACCGAAGT AACGGGTCAG CTCGTCGCAC AACAGGCGTC GCTCCTCGGC 
51 CTGCATCAGG CTGCCCAGCG GGCCCTGGAA CCAGTCGCGC GCGCCCGGTT 
101 GAT 



Fig. 20 

25al2.3 SEQIDNO:120 

1 GCGGTGCCCT GGATGTCGTC GTTGAAGCAG CACAGCTCGT CCTTGTAGCG 
51 CTCCAGCAAC GGCATGGCAT TGGTCTGGGC GAAGTCCTCG AATTGCAGCA 
101 GGACCTTGGG CCACGGCGCT TGATCGCCTG GATGAACAGG TCGACAA 



Fig. 21 



8C12SEQID NO:121 



1 TATTTGTGTA TAAGNCTCAG GcTCtGGAGG GGCCGCTGGG CAGGCNNAAC 
51 NNCCTCGCGT NCTNGGCGAC GANTTNCNNA TGCTTCGCNT GCTGCCGGCG 
101 TCTCNNCCCT CNGTACTAgT CTACGCGTGG ACAACGTGGC 



2A8 SEQ ID NO:122 



Fig. 22 



1 NATTTGTGTA 
51 CCACCGCCGA 
101 CTTCGGGCAT 
151 AGGGTTGGTC 
201 GCGGTTTCCG 
251 CCCAACGCCG 
301 CCTT 



TAAGAGTCAG 
GCTGCTGAAG 
NCCNTCTNNA 
GAAGCAGCTC 
AGGCCATCGA 
ACTTCTACCA 



GATCGAACGC 
ATGCTCGAGC 
TCGATTCCTN 
GCCGACgAGG 
CAAGACCATG 
CGCCTCGGCG 

Fig. 23 



TTCTCTTCGC 
GCAAGGGACA 
CCCACGCAAC 
TCGGCGACAA 
TGGGAGCAGA 
TCNCCCNTCC 



CGCAGGAAAG 
AGATCATGGG 
GAAgTGATCA 
GGTCCTGTTC 
AGAACTGTTC 
NGTGCTTCCA 
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41A5 SEQ ID NO:123 
1 tcgttgtaca ggccgaacag gccgagctgc caggtgtcgc cctcg 

Fig. 24A 

50E12 SEQ ID NO: 124 
1 gagcagacct gggtacccat ggcttccttg acccgctgca cgatgatgcc cagcgccgcc 
61 ttcagatcct tggcggagtt ctcttcctgg acgatcttgc gcagcgtgtt gagcatgctc 
121 ggggccttgt ctccgtgttc agtcccgcgc cagaaggcgc ggggccagtt ccttcagggc 
181 gcggcggtag acctcgcgct tgaaggtcac cacctgtccc agggggtacc agtaactcac 
241 ccagcgccag ccgtcgaact cgggcttgct ggtgatatcc atgcgcacgc gcgcctcgtc 
301 ggacatcagc cgcagcagga accatttctg cttctggccg atgcacagcg gctggctgtg 
361 ggtccgcacc aggcgctgcg gcaaacggta gcgcagccag ccgcgg 

Fig. 24B 



35A9 SEQ ID NO: 125 
1 cgcgacagta gcatataatc aatcatgagt gattaattaa ttggcgtttc tgtaacatat 
61 ccttatgatc tgcggcgcct ttcccttgtg aggacgttca gtggccagga aaaccaaaga 
121 ggaatcccag aaaacccgcg acggcatact cgatgccgcc gagcgggttt tcctggaaaa 
181 gggcgtgggc accactgcca 

" 24C 



Fig. 



pho23 SEQ ID NO: 126 
1 tcgatcccaa tgactacaag gacgaaatcc gccagatcgc ccgcgacaag gccaacctgg 
61 agctggacct gaagggcgac atcggctgga gcctgttccc ctggctgggc ctggagc 

Fig. 24D 

6G12SEQ ID NO: 127 
1 ggataggtgc ggcggaaaac gtacgggacg aaagagcggt tttcccgaat gacgcatcct 
61 cctgcaagcg caacttgctg gtggtcgata gcaagtaagg cgcgagacat gtcctgaact 
121 tcatgggggc tttttcttat agggcggact gtcgattctg ctagctggta atccttcttt 
181 tattgtctct gtgtgcgctt tttgtatgga tgtgtcgaat attttgaata tcgccgttca 
241 actttatcca gggccgcagt tcagtgattt attttctcga aaagtttgtt ttttccaata 
301 ttcatgcttc atagtctggc cggcc 

Fig. 24E 

25F1 SEQ ID NO: 128 
1 gcaggaaacc gttctccana tcctgggcga gaatcctcgg cacatgcacg ccggctccgg 
61 cgagcagtcc ggcgaccttg acgaacggtc ggcagtcttc ctggggcggc ggcgcgtcca 
121 tcaccaccag gctgcggtcc cctccctgcc agcggaaata acgacggaag ctggcgtcgc 
181 tactggccgg gatcagttcg gcggggggca cttcccccca accttcggca acgaacaact 
241 cgggcaaaca agagtccaac cagcaattca gctgctggaa acgggcatca tcagacattt 
301 acggggttct ccacggccct agccgttgcg caggtcatgc tttattatcc agcatctttt 

Fig. 24F 
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Sequences of PA14 50E12 encoding for YgdPPa and PtsPpa 

1/1 31/11 61/21 51/31 

GAA AAG GGC CAG ACG CAC GGG GT<3 ACT CCA TOG GTT GGC GGG TGG CGG gAG 6GC CGC GAG AGC CTT TTG CGA A<3G CTC CCA CCG G(?C CTT GGG AAA aCC CCT AGC CTA CCG GCT TTT GCC 

121/41 151/51 181/61 211/71 

GGC CCT GTA TCC TCC CCG CAC GAG TCG CAA AGC CGC GCG HG CCG CTA TCA OA GCT TTA TGG AAC AAT GCG GGC ACA TGC GAT TTC GAG GAT GTC CCA GCG TGA TOG ATT CCG ATG GTT 

H I D S D G F 

241/81 271/31 301/101 331/111 

TTC GCC CGA ATG TCG GCA TCA TTC TCG CCA ACG AGG CGG GGC AGG TGC TOT GGG CGC GGC GTA TCA ATC AGG AAG CCT GGC AGT TCC CGC AGG GAG GCA TCA ATG ATC GCG AAA CGC CGG 
R P S V G I I L A K E A G Q V L W A R R I H 5 E A H Q F P Q G G I H D I E T P E 

361/121 391/131 421/141 451/151 

AAG AGG CGC TGT ATC GCG AaT TGA ACG AAG AAG TCG GGC TGG AGG CCG GGG ACG TGC GCA TCC TGG CCT GCA CCC GCG GCT GGc TGC GCT ACC GTT TGC CGC AGC GCC TGG TGC GGA CCC 
S A L I S E L H E E V G 1 E A G D V I I L A C T S G » L R Y 11 L P 0 * I V I 1 £ 

481/161 511/171 541/181 571/191 

ACA GCC AGC CGC TGT GCA TCG GCC AGA AGC AGA AAT GGT TCC TGC TGC GGC TGA TGT CCG ACG AGG CGC GCG TGC GCA TGG ATA TCA CCA GCA AGC CCG AGT TCG ACG GcT GGC GCT GGG 
SQPLCIGQKQKHFLLRLHSDEARVRMDITSKPEFDGHRWV 

601/201 631/211 661/221 691/231 

TGA GTT ACT GGT ACC CCC TGG GAC AGG TGG TGA CCT TCA AGC GCG AGG TCT ACC GCC GCG CCC TGA AGg AAC TGG CcC CGc GCC TTC TGG CGC GGG ACT GAA CAC GGA GAC AAG GCC CCG 
SYNYPLGQVVTFIREVYSRALKELAPRLLARD* 

721/241 751/251 781/261 811/271 

AGC ATG CTC AAC ACG CTG CGC AAG ATC GTC CAG GAA GTG AAC TCC GCC AAG GAT CTG AAG GCG GCG CIG GGC ATC ATC GTG CAG CGG CTC AAG GAA GCC ATG GGT ACC CAG GTC TGC TCG 
H L H T L R I I V 8 E V H S A I D L I A A L G I I V Q R V I E A H G T Q V C S 

841/281 871/291 901/301 931/311 

GTG TAC CTG CTC GAC ACC GAG ACC CAG CGT TTC GTC CFG ATG GCC ACC GAA GGC CTC AAC AAG CGT TCC ATC GGC AAG GTC AGC ATG GCc CCC AGC GAA GGC CTG GTC GGC CTG GTC GGC 

V Y L L D T E T Q R P V L M A T E G L H I R S I G I V S X A P S E G L V G L V G 

961/321 991/331 1021/341 1051/351 

ACC CGC GAG GAG CCG CTC AAC CTG GAG AAC GCC GCC GCC CAC CCG CGC TAC CGC TAT TTC GCC GAG ACC GGC GAG GAG CGC TAC GCG TCG TTC CTC GGC GCG CCG ATC ATC CAC CAT aGG 

T R E E P L N L E H A A A H P R Y R Y F A E T G E E R Y A S F L G A P I I H H R 

1081/361 1111/371 1141/381 1171/391 

CGG GTG ATG GGG GTG CTG GTG GTG CAG CAG AAG GAG CGC CGC CAG TTC GAC GAA GGC GAG GAg GCC TTC CTC GTC ACC ATG AGC GOC CAG CTC GCC GGG GTC ATC GOG CAT GCC GAG GCG 

RVHGVLVVQQKERRQFDEGEEAFLVTHSAQLAGVIAHAEA 

1201/401 1231/411 1261/421 1291/431 

ACC GGT TCG ATC CGC GGC CTG GGC AAG CTC GGC AAG GGC ATC CAG GAA GCC AAG TTC GTC GGC GTG CCC GGC GCC CCC GGG GTC GGG GTG GGC AAG GCG GTG GTG GTG TTG CCt CCG GCC 

T G S I R G I> G I L G E G I Q E A I F V G V P G A P G V G V G X A V V V L P P A 

1321/441 1351/451 1381/461 1411/471 

GAC CTG GAA GTG GTG CCG GAC AAG CAG GTC GAC GAC ATC GAC GCC GAG ATC GCC CTG TTC AAG CAG GCC CTG GAG GGC GTT CGC GCC GAC ATG CGC GOG CTG TOG AGC AAG CTC GCC AGC 

DLEVVPDKQVDDIDAEIALFKQALSGVRADHRAISSKLAS 

1441/481 1471/491 1501/501 1531/511 

CAG tTG CGC AAG GAA GAA CGC GCG CTG TTC GAC GTC TAC CTG ATG ATG CTC GAC GAT GCC TCC ATC GGC AAC GAG GTC AAG CGC ATC ATC CGT ACC GGC CAG TGG GCC CAG GGC GCC CTG 

{ I R I E E R A L F D V Y L X H L D D A S I G N E V I R I I R T G Q K A J G A L 

1561/521 1591/531 1621/541 1651/551 

CGC CAG GTG GTG ATG GAG CAC GTG CAG CGC TTC GAG CTG ATG GAC GAC GCC TAT CTC CGC GAG CGC GCC TCC GAC GTC AAG GAC ATc GGT CGC CGC CTG CTC GCC TAC CTg CAG GAA GAa 

RQVVHEHVQRFEItMDDAYLRERASDVRDIGRRLLAYIiQEE 

1681/561 1711/571 1741/581 1771/591 

CGC AAG CAG AAC CTG ACC TAC CCG GAg CAG ACC ATC ATC GTC AGC GAG GAG CTG TCG CCG GCG ATG CTC GGC GAG GTG CCG GAA GGG CGC CTG GTC GGC CTG GTC TCG GTG CTC GGC TCG 

I I 5 K L T Y P E Q T I I V S E E L S P A K L G E V P E G R L V G L V S V L G S 

1801/601 1831/611 1861/621 1891/631 

GGC AAC TCG CAC GTG GCG ATC CTC GCC CGT GCC ATG GGC ATC CCC ACG GTG ATG GGG GCG GTC GAC CTG CCG TAC TCC AAG GTC GAC GGt ATC GAC CTG ATC GTC GAT GGC TAC CAC GGC 

G H S H V A I L A R A H G I P T V K G A V D L P Y S I V D G I D L I V D G Y 1 G 
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1921/641 1551/651 1981/661 2011/671 

GAG GTC TAC ACC AAC CCC TCC GCC GAG OG GTG CGC CAG TAC AGO GAC GTG GTC GCC GAG GAG CGC GAG CTG AGC AAG GGC CTG GCG GCC CTG CGC GAG CTG CCC TGC GAG ACC CTC GAC 

E V Y T H P S A E L V £ { Y S D V V A E E I E L S I G L A A L R E L P C E T L D 

2041/681 2071/691 2101/701 2131/711 

GGC CAC CGC ATG CC6 CTC TGG GTC AAC ACC GGC CTG CTC GCC GAT GTC GCC CGC GCC CAG GAG CGT GGC GCC GAG GGC GTG GGC CTG TAC CGC ACC GAA GTG CC6 TTC ATG ATC AAC GAC 

G H R X P L R V R T G L L A D V A R A 5 E K G A E G V G L Y R T E V P F X I H D 

2161/721 2191/731 2221/741 2251/751 

CGC TTC CCC AGC GAG AAG GAA CAG cTG GCG ATC TAC CGC GAG CAG CTC AG! GCC TTC CAC CCG CTG CCG GTG ACC ATG CGC ACC CTG GAT ATC GGC GGC GAC AAG GCG CTG TCC TAC TTC 

R F P S I I B Q I A I Y R E Q L S A P R P L P V T M R T L D I G G D I A L S Y F 

2281/761 2311/771 2341/781 2371/791 

CCG ATC AAG GAA GAC AAC CCG TTC CTC GGc TGG CGC GGC ATC CGC GTC ACC CTC GAC CAC CCG GAG ATC TTC CTG GTC CAG ACC CGC GCC ATG CTC AAG GCC AGC GAA GGA CTG GAC AAC 

PIKEDHPFLGHRGIRVTLDHPEIFLVQTRAXLKASEGIiDN 

2401/801 2431/811 2461/821 2491/831 

CTG CGC ATC CTG CTG CCG ATG ATC TCC GGC ACC CAC GAG CTG GAA GAG GCC CTG CAC CTG ATC CAC CGC GCC TGG GGC GAG GTG CGC GAC GAG GGC GTC GAC ATC GCC ATG CCG CCt ATC 

L R I L L P X I S G T H E L E E A L H L I H R A W G E V R D E G V D I A X P P I 

2521/841 2551/851 2581/861 2611/871 

G<K ATG ATG GTC GAG ATT CCC GCC GCC ffTG TAC CAG ACC CGC GAG CTG GCC CGt CAG GTC GAC TTC CTT TCG GTC GGT TCG AAC GAC CTG ACC CAG TAC CTG CTG GOG GTC GAC CGC AAC 

GHHVEIPAAVYQTRELARQVDFLSVGSHDLTQYLLAVDRK 

2641/881 2671/891 2701/901 2731/911 

AAT CCG CGS GTC GCC GAC CTC TAC GAC TAC CTG CAT CCG GCC GTg CTG CAT GCG TTG AAG AAG GTG GTC GAC GAT GCC CAC CK GAA GGC AAG CCG GTG AGC ATC TGC GGC GAG ATG GCC 

R P R V A D L Y D Y L H P A V L H A L I I V V D D A H L E G I P V S ' I C G R X A 

2761/921 2791/931 2821/941 2851/951 

GGC GAT CCC GCG GCT GCC GTG OT CTG ATG GCG ATG GGC TTC GAC AGC CTG TCG ATG AAC GCC ACC AAC CTG CCC AAG GTG AAG TGG CTG CTG CGC CAG ATC ACC CTG GAC AAG GCC CGG 

G D P A A A V L L K A X G F D S L S X R A T 8 L P I V I V L L R { I T L D I A R 

2881/961 2911/971 2941/981 2971/991 

GAC CTG CTC GGC CAG TTG CTC ACC TTC GAC AAC CCG CAG GTC ATC CAC AGC TCG CTG CAC CTG GCG TTG CGC AAC CTC GGC CTG GGT CGC GTG ATC AAC CCG GCG GCT ACC GTC CAG CCC 

DLIGQLLTFDNPQVIHSSLHLALRNLGLGRVIHPAATVQP 

3001/1001 
TGATTTTCCC 
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Sequence of PA14 35A9 encoding mtrRPa 

1/1 31/11 61/21 91/31 

GTC GAT TT6 GAA CAG CAC GGT GCC GGC GCG GAC TgC CTG 6CC TTC CTC GTA CAG GCG ACG GOT GAC GAT GCC GGC GAC GCG CGC CC6 CgC cTC gGC CTG GCG 6TA CGC TTC CA6 6CG TCC 
121/41 151/51 181/61 211/71 

GGG CAG CTC GCT GGT GAT GCC GAT gGG C(5C CGG CCT GGC GAC GAT CAC GCC GAC CTC GGC 6GG GGC CTC CGC AGT CTT CCC GGt GTC CGC TGC TTC TTC GCA GCC CAG CAG GAA TAG GGC 

241/81 271/91 301/101 331/111 

GAC CAG GGC CGC CAG CAG CCC GCG CAG CGA GCC GGT CCA TTG GAT GTG CAT GGG TGT CCC TCG ATT CGT GAA CTC GCG AGC TTG CCC GGG AAg GGG CAC CGC AAC TCA CGA GCG GCG CGA 

361/121 391/131 421/141 ' 451/151 

CAG TAG CAT ATA ATC AAT CAT GAG TGA cTA ATT AAT TGG CGT TTC TGT AAC ATA TCC 1TA TGA TCT GCG GCG CCT TTC CCT TGT GAG GAC GTT CAG TGG CCA GGA AAA CCA AAG AGG AAT 

MARKTKEES 

481/161 511/171 541/181 571/191 

CCC AGA AAA CCC GCG AtG GCA TAC TCG ATG CCG CCG AGC GGG TTT TCC TGG AAA AGG GCG TGG GCA CCA CTG CCA TGG CCG ACC TGG CGG ACG CCG COG GGG TTT CTC GCG GTG CGG TCT 
QKTRDGILDAAERVFLEKGVGTTAMADLADAAGVSRGAVY 

601/201 631/211 661/221 691/231 

ACG GCC ACT ACA AGA ACA AGA TO AGG TCT GtC TGG CGA TGT 606 ACC GOG CCT TCG GCC AGA TOG AGG TAC CCG AtG AAA ACG CCA GGG TGC CGG CGC TGG AcA TCC TOC TGC GCG COG 
GHYKNKIEVCLAHCDRAFGQIEVPDENARVPALDILLRAG 

721/241 751/251 781/261 811/271 

GCA TGG GCT TTC TOC GCC AGT GCT GCG AaC COG GTT OGG TGC AGC G©3 TGC TOG AGA TCC TCT ACC K!A AGT GCG AAC GCA GCG ACG AGA ACG AGC C®? TGT TGC GCC GCC GOG AGC rGC 
HGFLRQCCEPGSVQRVLEILYLKCERSDEHEPLLRRREliIi 

841/281 871/291 901/301 931/311 

TCG AGA AGC AGG GGC AAC GCT TCG GCC gaC GGC AGA TCC GCC GGG CGG TGG AgC GCG GCG AAC TGC CGG CGC GGC TGG ACG TCG AGC TGG CCA GCA TCT ATC TGC AAT CGC TgT GGG ACG 
E -J 0 -6" fl— R"F G- R" R— Q I~R -R" A V E- R G E L h A R L D V -R- L A S I Y L Q S L H D G 

961/321 991/331 1021/341 1051/351 

GCA TCT GCG GCA CCC TGG CCT GGA CCG AGC GCt TGC GCG ACG ATC CCT GGA gCC GCG CCG AAC GCA TGT TCC GCG CCG GCC TCG AtA GCC TGC GCA GTT CfC CCT ACC TcT TGC TGG CGG 
I C G T L A H T E R L R D D P I S R A E R H F R A G L D S L R S S P Y L L L A D 

1081/361 1111/371 1141/381 1171/391 

ACG CCT GAG GGC GTC AAT CGT COG CCA TCA GGT GCC TGC GCT GGT CCT CGG CGC CGG CGA CCA CCA GCC GCT GGG CGT CcT CCT CGC TGA TGT GCA GGC GCT TGC CaT CGA TGT AGA GCA 
A * 

1201/401 1231/411 1261/421 1291/431 

CCG ACA GGC GCG CCT CGG CGT CGG TAC CGA TK GCA GGC TGT CGA CCG GCG CGC GAT GCC GGC TGC CTT CGA TCT CCA CGC TGC AGA tGC CTT GTT CCG AAT CGA TTT CGA TGG ACA TGG 

1321/441 1351/451 1381/461 1411/471 

GAa CCT CCc GTT TTc TCC GCC TAC CTT GGG TGG ACC CCG GGC ATC CGC GCG GOT TCT GTC ACG GTA GCT TCA CGC CAG CGT CAC GCG CCT GCC ACC GCG CTT GGC TGC AAT CGT CCG CAG 

1441/481 1471/491 1501/501 

AGA aGG CGA GGC CAG CGG AGG ACG ACG CCA TGC GGC TAT GCG TGA TTG GTG CGG GCT ATG TGG GAC TGG TGA 



Fig. 24J 



115/133 

Sequences of PA14 25F1 encoding for orf! OrfU and DjIAPa 

1/1 31/11 51/21 91/31 

C6A GGA ATC CAG TCG AGG TGC 6Ag TAG TCC GCA CTG CGG 61T cTC iGC GC6 CGA CCa COG GBC TCG GTG ACC AGG CGC TGG GIC GGC TCT GCC TCG AOG 6TT TC6 CCI CCG CTC COG GAC 

121/u 151/51 181/61 211/71 

ACGCTGCTGCCCGCC(!CGGCGGTG(!TGACCGAGGTCGCGOT 

241/81 271/91 301/101 331/111 

CAG TCG AAC TGA TCG GCG GCC TGT ACA GTC AAT GCT GCG ACC GGC TGC AGA GCC AGC AGG CTG CCG GTG ACC AGC AGG GGA AAC TTT CTT CGA A&C ACG AGG GAT TTC ACT GCC ATC TTG 

361/121 391/131 421/141 451/151 

TTA ATC CGG GCT TCC TGC GCG CCA TCG GCC CGG TGG GCC GCA CGC CTC TCG ATG GGC TGA AAA AGA TGC TGG ATA ATA AAG CAT GAC CTG CGC AAC GGC TAG GGC CGT GGA GJA CCC CGI 

481/161 511/171 541/181 571/191 

AAA TGT CTG ATG ATG CCC GTT TCC AGC AGC TGA ATc GCT GGT TGG ACTCrTGTTTGCCCGAGTTGTTCGTTGCCG AAG GTT GGG GGG AAG TGC CCC CCG CCG AAC TGA TCC CGG CCA GTA 
H S D D A R P Q 0 L II R H L D S C L P ! L P V A E G I G E V P P A ! I I P A S S 

601/201 631/211 661/221 691/231 

GCG ACG CCA GCT TCC GTC GTT ATT TCC GCT GGC AGG GAG GGG ACC GCA GCC TGG TGG TGA TGG ACG CGC CGC CGC CCC AGG AAG ACT GCC GAC CGT TOG TCA AGG TCG CCG GAC TGC TCG 
D A S P R R Y P 11 H Q G G D R S L V V H D A P P P 8 E B C P. P P V I V A G I I A 

721/241 751/251 781/261 811/271 

CCG GAG CCG GCG TGC ATG TGC CGA GGA TTC TCG CCC AGG AtC TGGAGAACGGnTCCTGCTGCTCAGTGACCTGGGCCGGC AGA CCT ACC TCG ACG TGC TTC ATC CCG GaA ATG CCG ACG 
G j G V H V P 1 I L A 8 D L E H G F L L L S D L G R Q T Y L D V 1 1 P G H A D E 

841/281 871/291 901/301 931/311 

AGC TGT TCG AAC CGG CCC TGG ATG CGC TGA TCG CCT TCC AGA AGG TCG ATG TCG CCG GTG TCC TGC CTG CCT ACG ACG AAG CGG TGC TGC GCC GCG AGC TGC AGC TGT TCC CCG ACT GGT 
I F E P A L D A L I A F 8 J V D V A G V L P A I D E A V L R R E L 8 I P P D « ^ 

961/321 -991/331 - 1021/341 1051/351 

ACC TGG CCC GCC ACC TCG GCG TGG AGC TGG AGG GCG AGA CGC TGG CCC GCT GGc AgC GGA TCT GCG ACC TGC TGG TAC GCA GCG CGC TGG AGC AAC CGC GGG TGT TCG TCC ATC GCG ACT 
L A R E I G V E L E G E T L A R H 6 R I C D L L V R S A L E 8 P R V P V H S D T 

1081/361 Ull/371 1141/381 1171/391 

ATA TGC CGC GCA AcC TGA TGC TCA GCG AGC CCA ACC CGG GCG TCC TCG ACT TCC AGG ACG CCC TGC ACG GCC CGG TCA CCT ACG ATG TCA CCT GCC TGT ACA AGG AtG CCT TCG TCA GTT 
M p j H L K L S E P N P G V L D P 8 D A L H G P V T Y D V T C L I I D A P V S I 

1201/401 1231/411 1261/421 1291/431 

GGC CGG AGC CGC GCG TGC ATG CCG CGC TGA gtC GTT ACT GGA AGA AGG CGA CCT GGG CCG GCA TCC CGC TGC CGC CAA GCT TCG AgG ACT TCC TCC GcG CCA GCG ACC TGA TGG GCG TGC 
p i p H V H A A L S R Y H I I A T I A G I P L P P 8 P E D F L R A S D L H G V 8 

1321/441 1351/451 1381/461 1411/471 

AGC GCC ACC TGA AGG TGA TTG GCA TCT TCG CCC GTA TtT GTC ACC GCG ACG GCA AGC CGC GCT ACC TGG GTG ACG TGC CaC GCT TCT TCC GTT ATC TGG AAA CCG CCG TGG CGC GCC GTC 
R H i I V I G I P A R I C H R D G I P R Y L G D V P R F F R I L E T A V A R R P 

1441/481 1471/491 1501/501 1531/511 

CCG AGC TGG CCG AAC TGG GCG AGC TGC TGG CCT CGC TGC CGC AGG GAG CCG AGG CAT GAA GGC GAT GAT CCT CGC CGC CGG cCG TGG CGA GCG CAT GCG GCC GAC CAC CCT GCA CAC GCC 
M K A i I L A A G R G E R 11 R P I I L E T P E L A E L G E L L A S L P 8 A A E A 

1561/521 1591/531 1621/541 1651/551 

CAA GCC GCT GAT CGA GGC CGC CGG CGT GCC ATT GAT CGA GCG TCA GTT GCT GGC GCT GCG CCA GGC CGG AGT CGA CGA CTG GGT GAT CAA CCA TGC CTG GCT TGG CGA GCA GAT CGA GGC 
K p I I E A A G V P L I E R Q L I A L R Q A G V D D H V I R B A K L G E 8 I * * 
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1681/561 1711/571 1741/581 1771/591 

CTA TCT CGG CGA CGG cTC GCG CCT GGG CGG GCG GAT CX CTA TTC aCC cGA GGG a6A ACC GCT GGA AAC CGG cGG t(3G AAT CTT OCG CGC CCT 6CC GTT GCT CGG CGA GCA GCC GTT CCT 
Y L G D G S S I G G 8 I A T S P E G 8 P L B T G G G I ? K A L P L L G S 8 P F L 

1801/601 1831/611 1861/621 1891/631 

GfTT GCT CAA OGG CGA TGT CTG GAG CGA CTT CGA CTA CTC TC6 GCT GCA TCT TGC CGA CGG CGA CCT GGC GCA TCT GGT GCT GGT CGA CAA CCC G6C GCA CCA TCC C6C CGG CGA TTT CCA 
L I R G 0 V W S D P D Y S 8 L H L A D G D L A H L V L V D H P A H fl P A G D F H 



1921/641 1951/651 1981/661 2011/671 

CCT GGA TGC CGG CGG ACG GGT GGG CGA GAC CCG CGA AGC GGG CGG CAA CCT gAC CTA CAG CGG GAT CGC CGT ACT GCA TCC CGC GCT GTT CGA GGG CTG OCA GCC GGG CGC CTT CAA GCT 
L D A G G R V G I T R E A G G H L T I S G I A V L H P A L F E G C 5 P G A F K L 

2041/681 2071/691 2101/701 2131/711 

GGC GCC GCT ATT GCG CAA GGC CAT CGC CGC GGG GCG GGT CAG CGG CGA ACA CtA TCG TGG GCA GTG GGT CGA CGT CGG TAC CCA CGA GCG CCT GGC GGA AGT CGA GOG ATT GCT GGC GGA 
A P L L R I A 1 A A G R V S G E H Y R G 5 V V D V G T H E R L A E V E R L L A E 

2161/721 2191/731 2221/741 2251/751 

GCA CGC CTG AGA TGC TCT GGC CCG CTA CGC TGA TCG GAG CCG GAG COG GCT GGG CCC TGG CCA GCA TCC CCG GCG CCC TGC TQ3 GCG GCC TGC GGC AAC TGC TGG ACC GCA GGT TGC 
HA* H L W P A T L I G A G A G V A L A S I P G A L L G G L L G 8 L L D R R L R 

2281/761 2311/771 2341/781 2371/791 

GCC TGG AGT CCT GGC GCG GCC TGC TGG CGC GCT TGC GCG GGC GGG CGG TGA ACG ATG AGG ACG ACC TGC TGT TcC AGT TGC TCG GCT ATC TGG CCA AGA GCG GCG GGC GGG TGG AGG AGA 
LESVRGLLARLRGRAVNDEDDLLFQLIiGYLAKSGGRVEEH 

2401/801 2431/811 2461/821 2491/831 

TGC ATA TCC GCC AGG CGC GCG AGG AGA TGG CGT TGC GCA AGC TCG ATA GGC GAG CCC AGC GGC GTG CCA TCG CGT CCT TCG GCA AGG GCA AGG CCG GCA TCG CCC ATC TGC AGG CGG AGG 
H I R 8 A R E E M A L I I L D R R A 8 R R A I A S F G K G I A G I A H L 8 A E V 

2521/841 2551/851 2581/861 2611/871 

TCG CGCGTC TGA AGG GCG AAC GTG CGG AGG CAG TAT TGC TCG CCT GCT GGC GGA TGG CCT GGG CTG GCG GCG TGC TCA GCC AGT CGG CGC GAC AAC TGG TGT TGC AAT GGG GGC GCT GK 
A R L I G E R A E A V L L A C « R K A S A G G V L S 8 S A R 8 L V 1 8 » 6 R V L 

2641/881 2671/891 2701/901 2731/911 

TGG GTT GGT CGG CGG AGC GAA CGG AAC GCT TGT CGG CGC GGG TCA TGC CGA AGC GGA CGC GCG CTG TCG CCC GGG ATA GCT ACC GTG AGG CCC TGC TGC TGC TCG GCG TGG AGG CCG GAA 
GWSAERTERLSARVKPKRTRAVARDSYREALLLLGVEAGS 

2761/921 2791/931 2821/941 2851/951 

GCG AGC CGG CGC TGA TCA AAC GCG CCT ATC GCA AGC TGA TCA GCC AGC ATC ATC CGG ACA AAC TGG CGG GAG CCG GCG CCA GCG TCG AGC GCG TGC GTG C©J CTA CCG AGA AAA CCC GTG 
E P A L I I R A Y R I L I S 8 H H P D I L A G A G A S V E I V R A A T E I T R E 

2881/961 2911/971 2941/981 2971/991 

AAT TGC AGG CGG CCT ACG CCC TGG TCC GAG AGC GTG AGG GGT TCC GCT GAT CAC TCC GCA GGT TTC TGC GCA TCG GCC TGC AGG TGA AGA CTG AGC CAG CO? CGG ATT CGT CGG TAC AGT 
L 8 A A Y A L V R E R E G F R * 

3001/1001 3031/1011 3061/1021 3091/1031 

TGC TCC TGC TCC GCC TTG GGG TCG GCC GGT AGa GCC TGC ATC GCG ATT TGT ACG TAG GCC GGG TGT TTC TGC CGC TTG CCG GCC TGC ATG CGC AGC CTG GCC GCC TCG CGG TCG GCG CG 
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PhnA and PhnB SEQ ID NO: 129 

1 CTGCAGCGTC TGCCGACCCT 
51 GCTGCGGGGG CGGCTGGCCG 
101 GTCTGCGCCT GTGCCGACGG 
151 CTCGACGAAC TGAGCGAGGA 
201 CGACTTCCTG CCCGGCGAAC 
251 AGATTCTCTC CCGCCAGGCG 
301 GCGGCGATCG CGCCCGACGG 
351 TCGATGATGA ATATGCCGTT 
401 ATGGGCGCGG GGAGGTGGCT 
451 AAGTCTGGAG TACCGCACGC 
501 TGGCTGGCGC CAACCGGATG 
551 GCTGCGCGGC GTAGCGTGGC 
601 CTGGGGGCGC GATGTGGTGC 
651 TGCTGGCGCC GTTGAGCGAG 
701 GACGGCGACA CCCTGCACTG 
751 CGAGTGGCGA CGCCTGCATG 
801 TGCTCGGAAC GCTGGGCGAC 
851 TTCAGTTTCG ACCTGGCCGA 
901 ACCTGCGCGG CATTGCCCGG 
951 TGGATATCGA TCACCTGGCG 
1001 GATCCGGCCG GGCACGACCG 
1051 CGAATTCCAT GGCGCCGTGG 
1101 TACGGGCCGG CAACTACCAG 
1151 CAGGTAGAAC GCCTGCAGGC 
1201 CGTACCTTCG CGCAGCTTCA 
1251 ATCGCCAGTT GTGCCTGCGC 
1301 GCGGGGGACT TCTGCCTGTT 
1351 CGACGCGGAG AGTCGCGAGG 
1401 CGCGCGGATG CGATGCCCGG 
1451 CTGGAAGCGG AGTTGCGCCT 
1501 GCTGGTCGAC CTGGCGCGCA 
1551 CCCGGCAGGT GCGCGACATG 
1601 CACCTGGTCT CGCGCGTGGC 
1651 GCATGCCTAC CGTGCCTGCC 
1701 AGGTCCGTGC CATGCAGTTG 
1751 AGCTACGGTG GTGCGATCGG 
1801 CAGCATTGTC ATCCGCTCCG 
1851 GGGCAGGCGC CGGCGTGGTG 
1901 GAAACCCGCA ACAAGGCGCT 
1951 ACGCGAAAGG GGAGAGCGCG 
2001 TCCTTCACCT ACAACCTGGT 
2051 CCGGGTGATG CGCAACGATA 
2101 TGGCCGACGG TTGCGAACTG 



GCTGCAACTG ATCCCGGGAC ACGGCGGCCT 
CGGATGGGGC CGAGTCGGCC TATACCGAGT 
TTGCTCTGGC GCCAGTCCAT GGGCGAGTCC 
GCTGCACCGC GCCTGGGGAG GGCAGAGCGT 
TGCACCTGGG GAGCATGCGC CGGATGCTGG 
CTGCCTCTGG ACTGAGGCGG AACATCCATT 
CTGCGGTCGC AATTGGGGGA AATGGGGGTA 
GCGCGCTAGC GTCGCGCAGG CCAGTCGCCC 
CGTGAGTGGG GTTGGCTATC GACTGGAAGA 
TGGTGCCGGA GGCGCTGTCG ATCTGGCGCA 
CTGTTCGACT GCTTCGACGT GGACAGCAAG 
GATCCTTTCC AGCTGCCTGC GCATCGAGTG 
TGCGGGCGTT GAACTCCAAC GGACGCGCCT 
GCCTGTCCGG CCCAGGTCAC CTGCTTGCGT 
GCGCTTCCCC CCGGAAGAGC CGCATGCGGA 
GCCTGTCCAG CCTGGAGGCG CTGCGCCGCG 
GCGGAGGGGC CTGCGCTGCT GGGCGGCCTG 
GCAGTTCGAA CCCTTGCCGG CGCCGGCCGA 
ACTACCTGTT CCTGGTGCCG GAGTTGCTGC 
CGCCGGACTT CGCTGCAAGC GTTCGTCCAC 
GTTGGCCGCC AGCCTGCGCC AATGTGCCGA 
AGGAGGCTTC CGAGTCGCCG GTGGCAGGCG 
GTCGACCTGG ACGATGCGAG CTTTGCCCGC 
CCACGTGAGG GCCGGCGACG TGTTCCAGAT 
GCATGCCGTG CGCGGACCCC TGGCGGGCCT 
AACCCCAGCC CGTACCGCTT CTTCCTCGAT 
CGGCGCTTCG CCGGAGTCGG CATTGAAGTA 
TGGAACTCTA TCCCATTGCC GGCACCCGCC 
GGCGCCATCG ATGCGGAACT GGACAATCGC 
GGATGCCAAG GAGATCGCCG AGCACATGAT 
ACGATCTGGC GCGCGTCTGC CGCAGCGGTA 
CTCAAGGTCG ATCGCTACAG CCACGTGATG 
TGGCGAACTG CACGGCGAAC TGGATGCGCT 
TGAACATGGG CACCCTGGTC GGCGCGCCGA 
CTGCGGCAGT ACGAGGATGG CTATCGCGGC 
CATTCTCGAC AGCGCCGGCA ACCTCGATAC 
CCGAGGTCCG CGAAGGTATC GCGCGGGTTC 
CTGGATTCGG ATCCACGGCT GGAGGCCGAG 
GGCGGTGCTG ACCGCCGTGG CCGCTGCCGA 
ATGCGCATCA CGCTGTTGGA TAACTTCGAT 
CGAGCAGTTC TGCCTGCTCG GCGCGGAGGT 
CGCCGTTGCC GACGATCCAG GCGGCATTGC 
CTGGTGCTGT CGCCGGGGCC CGGTCGGCCG 
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2151 GAAGACGCCG GCTGTATGCT GGAATTGCTC GCCTGGGCCC GCGGGCGCTT 
2201 GCCGGTGCTC GGCGTCTGCC TCGGCCACCA GGCGCTGGCG CTGGCCGCCG 
2251 GTGGCGCGGT GGGCGAGGCG AGGAAGCCGC TGCATGGCAA GAGCACGTCC 
2301 CTGCGTTTCG ATCAGCGTCA CCCGCTGTTC GACGGCATCG CTGACCTGCG 
2351 CGTCGCGCGC TACCACTCGC TGGTGGTCAG TCGCCTGCCG GAAGGTTTCG 
2401 ACTGCCTGGC CGATGCCGAT GGCGAGATCA TGGCGATGGC CGATCCGCGC 
2451 AATCGACAGC TGGGCTTGCA ATTCCATCCC GAGTCGATTC TCACCACCCA 
2501 CGGCCAGCGT CTGCTGGAGA ACGCTCTACT CTGGTGCGGC GCGTTGGCGG 
2551 TCGCGGAGCG CCTTCGGGCC TGAGCGGCGC TGCGCAGTTT CGACCGAGGC 
2601 TCGGTTGCCA GGCCGGCGCA TCGTCGAAAC GCTGGCGGCC CAGTTCGCGC 
2651 AGGCGCTGGC GGGCGCTTTC GAGAAAGCGA CGGAAGCTGC GCTCGGATTC 
2701 CAGCGCGGTG TTGTAGTAGC AATACACCTT GGTGTCGATG CCGCCCGGTT 
2751 CGTACAGTTC GCTGAGGACT GCCAGGGTAC CGTTGCGCAG GCGTTCCTcG 
2801 ACGAAATAAT GCGGCGaGAT GCCCCATCCG ACGCCGGCTT CCACCAGACG 
2851 CAGCATGTCG TCGAAGTTTT CCACGAAGAG CACCTTGTCG CTGACCGGCC 
2901 GCAGCAGGTT CGAATGCTGC CCGGAGCGGC TgCCGAGGCT GATCTGCCGG 
2951 TAATTGGCCA GGCTCGCGAT GCTGTGCAGG GAGGCATTGC ACAACGGGTG 
3001 CTGCGGATGG GCGACGACGA ACGCCTTGGT GTAGCCGAGC ACGCACTGGT 
3051 TGAAGCGGGA GATCT 
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PhnA protein SEQ ID NO:130 

1 MGARRWLVSG VGYRLEESLE YRTLVPEALS IWRMAGANRM LFDCFDVDSK 

51 AARRSVAILS SCLRIECWGR DWLRALNSN GRALLAPLSE DCPAQVTCLR 

101 DGDTLHWRFP QEESHADEWR RLHGLSSLEA LRRVLGTLGD AEGPVLLGGL 

151 FSFDLAEQFE PLPAPAEPAR HCPDYLFLVP ELLLDIDHLA RRTSLQAFVH 

201 DPAGHDRLAA SLRQCADEFH GAVEEASESP VAGVRAGNYQ VDLDDASFAR 

251 QVERLQAHVR AGDVFQIVPS RSFSMPCADP WRAYRQLCLR NPSPYRFFLD 

301 AGDFCLFGAS PESALKYDAE SREVELYPIA GTRPRGRDAR GAIDAELDNR 

351 LEAELRLDAK EIAEHMMLVD LARNDLARVC RSGTRQVRDM LKVDRYSHVM 

401 HLVSRVAGEL HGELDALHAY RACLNMGTLV GAPKVRAMQL LRQYEDGYRG 

451 SYGGAIGILD SAGNLDTSIV IRSAEVREGI ARVRAGAGW LDSDPRLEAE 

501 ETRNKALAVL TAVAAAERER GERDAHHAVG 

Fig. 26 

PA14 degPSEQID NO:131 

1 CGTCCGATTC GGCCTGAGTC TTTCTCTTCC CTCGAACATC ACGGGAGCTG TAGTCGATGC 
61 ATACCCTAAA ACGCTGTATG GCTGCGATGG TGGCCTTGCT GGCCTTGAGC CTGGCGATGA 
121 CGGCCCGGGC AGAACTGCCG GACTTCACGC CTTTGGTCGA ACAGGCGTCG CCGGCGGTGG 
181 TGAATATCAG TACGCGGCAG AAGCTGCCGG ATCGCGCCAT GGCGCGCGGG CAGCTGTCGA 
241 TCCCCGACCT CGAAGGGCTG CCGCCGATGT TCCGCGACTT CCTCGAGCGC ACGATCCCGC 
301 AGGTTCCGCG CAATCCGCGC GGCCAGCAGC GCGAGGCGCA ATCGCTGGGC TCCGGCTTCA 
361 TCATCTCCAA CGACGGCTAC ATCCTCACCA ACAATCACGT CGTGGCCGAT GCCGACGAGA 
421 TCCTGGTGCG CCTGTCCGAC CGTAGCGAGC ACAAGGCCAA GCTGGTCGGC GCGGACCCGC 
481 GCAGCGACGT GGCGGTGCTG AAGATCGAGG CGAAGAACCT GCCGACCCTG AAACTGGGCG 
541 ATTCGAACAA GCTGAAAGTG GGCGAATGGG TCCTGGCCAT CGGTTCGCCG TTCGGCTTCG 
601 ATCACTCGGT CACCGCCGGT ATCGTCAGTG CCAAGGGGCG TAGCCTGCCG AACGAGAGCT 
661 ACGTACCCTT CATCCAGACC GACGTGGCGA TCAACCCGGG CAACTCCGGC GGTCCGCTGC 
721 TGAACCTGGA GGGCGAAGTG GTCGGCATCA ACTCGCAGAT CTTCACCCGT TCCGGCGGCT 
781 TCATGGGCCT GTCCTTCGCC ATCCCGATCG ATGTCGCGCT GAACGTCGCC GACCAGTTGA 
841 AGAAAGCCGG CAAGGTCAGC CGCGGCTGGC TGGGTGTGGT GATCCAGGAA GTGAACAAGG 
901 ATCTCGCCGA GTCCTTCGGC CTCGACAAGC CGTCCGGCGC GCTGGTGGCG CAGCTGGTGG 
961 AAGACGGTCC GGCGGCCAAG GGCGGCCTGC AGGTGGGCGA TGTGATCCTC AGCCTGAACG 
1021 GCCAGTCGAT CAACGAGTCC GCCGACCTGC CGCACCTGGT GGGCAACATG AAGCCGGGCG 
1081 ACAAGATCAA CCTGGACGTG ATTCGCAACG GCCAGCGCAA GTCCTTGAGC ATGGCGGTAG 
1141 GCAACCTTCC GGACGACGAC GAGGAAATCG CCTCGATGGG CGCTCCGGGC GCCGAGCGCA 
1201 GCAGCAACCG CCTGGGCGTG ACCGTCGCCG ACCTGACCGC CGAGCAGCGC AAGAGCCTGG 
1261 ATATCCAGGG CGGCGTGGTG ATCAAGGAAG TCCAGGACGG TCCGGCCGCG GTCATCGGCC 
1321 TGCGTCCGGG CGATGTCATC ACCCACCTGG ACAACAAGGC GGTGACCTCG ACCAAGATCT 
1381 TCGCCGACGT GGCCAAGGCC CTGCCGAAGA ACCGTTCGGT TTCGATGCGG GTACTG 
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PA14 degP protein SEQ ID NO: 132 

1 MHTLKRCMAA MVALLALSLA MTARAELPDF TPLVEQASPA WNISTRQKL 
51 PDRAMARGQL SIPDLEGLPP MFRDFLERTI PQVPRNPRGQ QREAQSLGSG 
101 FIISNDGYIL TNNHWADAD EILVRLSDRS EHKAKLVGAD PRSDVAVLKI 
151 EAKNLPTLKL GDSNKLKVGE WVLAIGSPFG FDHSVTAGIV SAKGRSLPNE 
201 SYVPFIQTDV AINPGNSGGP LLNLEGEWG INSQIFTRSG GFMGLSFAIP 
251 IDVALNVADQ LKKAGKVSRG WLGWIQEVN KDLAESFGLD KPSGALVAQL 
301 VEDGPAAKGG LQVGDVILSL NGQSINESAD LPHLVGNMKP GDKINLDVIR 
351 NGQRKSLSMA VGNLPDDDEE IASMGAPGAE RSSNRLGVTV ADLTAEQRKS 
401 LDIQGGWIK EVQDGPAAVI GLRPGDVITH LDNKAVTSTK IFADVAKALP 
451 KNRSVSMRVL 

Fig. 28 

PA 8830 algD SEQ ID NO:133 

1 GCGCGACAAA CAATCGAGGT GAATGCGATG CGAATCAGCA TCTTTGGTTT 
51 GGGCTATGTC GGTGCAGTAT GTGCTGGCTG CCTGTCGGCA CGCGGTCATG 
101 AAGTCATTGG TGTGGATGTC TCCAGCACCA AGATCGACCT GATCAACCAG 
151 GGCAAGTCGC CCATCGTCGA ACCGGGCCTG GAAGCGTTGT TGCAGCAAGG 
~2 0 rCCGGCAGACC-GGACGGCTGT-CGGGCACCAC CGACTTCAAG- AAGGCTGTGC 
251 TGGACTCCGA CGTATCGTTC ATCTGCGTCG GCACGCCGAG CAAGAAGAAC 
301 GGCGACCTGG ACCTGGGCTA CATCGAGACC GTCTGCCGCG AGATCGGCTT 
351 CGCCATCCGC GAGAAGTCCG AACGCCACAC CGTGGTGGTG CGCAGCACCG 
401 TACTGCCGGG CACCGTCAAC AACGTGGTGA TCCCGCTGAT CGAGGACTGC 
451 TCGGGCAAGA AGGCCGGGGT CGACTTCGGC GTCGGCACCA ACCCCGAATT 
501 CCTCCGCGAG AGCACCGCGA TCAAGGACTA CGACTTCCCG CCGATGACCG 
551 TGATCGGCGA ACTGGACAAG CAGACCGGCG ACCTTCTCGA GGAAATCTAC 
601 CGCGAGCTGG ACGCGCCGAT CATCCGCAAG ACCGTCGAGG TCGCCGAGAT 
651 GATCAAGTAC ACCTGCAACG TCTGGCACGC CGCCAAGGTC ACCTTCGCCA 
701 ACGAGATCGG CAACATCGCC AAGGCGGTCG GCGTCGACGG CCGCGAGGTG 
751 ATGGACGTGA TCTGCCAGGA CCACAAGCTC AACCTGTCGC GCTACTACAT 
801 GCGTCCCGGC TTCGCCTTCG GCGGCTCCTG CCTGCCCAAG GATGTACGCG 
851 CCCTCACCTA TCGCGCCAGC CAGCTGGACG TCGAGCACCC GATGCTCGGT 
901 TCGTTGATGC GCAGCAACTC CAACCAGGTG CAGAAGGCCT TCGATCTCAT 
951 CACCAGCCAC GACACCCGCA AGGTCGGCCT GCTCGGCCTG TCGTTCAAGG 
1001 CCGGCACCGA CGATTTGCGC GAAAGCCCGC TGGTGGAGCT GGCCGAGATG 
1051 CTCATCGGCA AGGGCTACGA GTTCCGCATC TTCGACCGCA ACGTCGAATA 
1101 CGCGCGTGTC CACGGGGCCA ACAAGGAATA CATCGAGTCG AAGATCCCGC 
1151 ACGTCTCCTC GCTGCTGGTC TCCGACCTCG ACGAAGTGGT GGCGAGTTCC 
1201 GATGTGCTGG TGCTGGGCAA TGGCGACGAG CTGTTCGTCG ACCTGGTGAA 
1251 CAAGACCCCG AGCGGCAAGA AGCTGGTCGA CCTGGTGGGC TTCATGCCGC 
1301 ACACCACCAC TGCCCAGGCC GAGGGCATCT GCTGGTAGCG G 
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PA 8830 algD protein SEQ ID NO: 134 

1 MRISIFGLGY VGAVCAGCLS ARGHEVIGVD VSSTKIDLIN QGKSPIVEPG 
51 LEALLQQGRQ TGRLSGTTDF KKAVLDSDVS FICVGTPSKK NGDLDLGYIE 
101 TVCREIGFAI REKSERHTW VRSTVLPGTV NNWIPLIED CSGKKAGVDF 
151 GVGTNPEFLR ESTAIKDYDF PPMTVIGELD KQTGDLLEEI YRELDAPIIR 
201 KTVEVAEMIK YTCNVWHAAK VTFANEIGNI AKAVGVDGRE VMDVICQDHK 
251 LNLSRYYMRP GFAFGGSCLP KDVRALTYRA SQLDVEHPML GSLMRSNSNQ 
301 VQKAFDLITS HDTRKVGLLG LSFKAGTDDL RESPLVELAE MLIGKGYEFR 
351 IFDRNVEYAR VHGANKEYIE SKIPHVSSLL VSDLDEWAS SDVLVLGNGD 
401 ELFVDLVNKT PSGKKLVDLV GFMPHTTTAQ AEGICW 
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>Contigll26 of Mutant 25A12 SEQ ID NO: 135 

. . AACACCGGACGCGCCCCGATCATGTGCGCTGA6CGCTACGCTACCGTCAA 

CGAAAAAGGCCACCTCGGGGTGGCCTTTTCGCGTTCTCGCACCGATCGCG 

CGGAATATCGGCGGTTAACGCCTCTCCCCCGTGCGCACCTGCGGCTGAGC 

CTCAGAACGAAGTCCGGCGGTAGGCACGGTAGCGCGGGAACCAGAAGTTC 

GCCTCGATGGCGTCGTTCAGTACCTCGTCGCTGGTATGCAGGGCCTTGCC 

CTCGGCCTGGGCCTGCTTGGCCACGGCGACGGCGATGCGCTTGCTGACCT 

CGCGGATGTCGCCCAGCGCCGGCAACACGGCGCCCTCGCCCTGGGTAACG 

ATCGGCGAGCAGTTGGCCAGGGCGTTGGCCGCGGCCATCAGCATGCCTTC 

GGTGACCCGATTGGCCCGCGCGGCGATCACCCCCAGGCCGATGCCGGGGA 

AGATATAGGCGTTGTTGCACTGGGCGATGGGAATCCGCTTGTCGCCCACC 

TGCACCGGTTGGAACGGGCTACCGGTGGCGACCAGCGCCTGGCCGTCGGT 

CCAGTTGAGGATTTCCTGCGGAGTCGCCTCGACCCGCGAGGTCGGGTTGG 

ACAGCGGCATCACCAGCGGCTGCTTGCAATGGCTGTGCAGCTCACGGATG 

ACCTCTTCGGAAAACAGCCCGCGCTGCCCGGAGACGCCGATCAGCACCGT 

CGGCCGGGCATTGCGGATCACTTCCAGCAACGCCAGGTCGTCGCCCTGCT 

GGCCGCCCCAGGCACCGAGATCGGCGCGCTTCTGCGCCAGGCGGTGCTGG 

AAGTCGACCAGGTTGCTCATGTCGTCGGTGAGCAGGCCCCAGCGGTCGAC 

CATGAAGATGCGCCGACGCGCCTGGGCCTCGTCCAGGCCCTCCAGTTGCA 

TGGCGGCGATGATCTGTTCGGCGATGCCGCAACCGGCGGAGGGGCGCCGA 

CGAAGGTCACGGTCTGCTCGCTGAGCTTCTCGCCCTTGGCCTTGCAAGCC 

GCCAGCAGGGTGCCCACGGCCACCGCGGCGGTGCCCTGGATGTCGTCGTT 

GAAGCAGCACAGCTCGTCCTTGTAGCGCTCCAGCAACGGCATGGCATTGG 

TCTGGGCGAAGTCCTCGAATTGCAGCAGGACGTTGGGCCAGCGGCGCTTG 

ATCGCCTGGATGAACAGGTCGACGAACTCCTCGTACTGCGCCCCGCTCAC 

CCGCTCGTGGCGCCACCCAATGTACATCGGGTCGTTGAGCAGGTCCGGGT 

TGTTGGTGCCGACGTCCAGCACCACCGGCAGGGTGTAGGCCGGGCTGATA 

CCGCCGCAGGTGTAACAGGGGACAGCTTGCCGATCGGGATGCCCATCCGG 

CCGATGCCCTGGTTGCCGAGGGCGAGGATCGGCTGGCTGTCGGTACAAAA 

CAATCTAAGGTGTCTTTGGTGGCTTGAAGGAGTTTCAATCGTTCGGGCCG 

GGAAGAATAAAGGCCCGGTGGGTCGAAACTTTGAATCTGGAAGGTTGCAA 

ACTGGGGGAAAAAATGGAAATTTTTAAGAGCCTAAGAGCGGAAAAAAGTT 

CTTTTTCTAAAAAGAAAAAAATGGGGAAAAAGTTGAAAAGTATATGATAA 

GAGCAGGTGTCAAAATGAATGTTTTGAAAGCCCAGTGAAATAAACTCTGG 

AAAAGGCAGTTATAAGGGCTATAAAAGGGATGAAAAAAGAAGTGTGTGAA 

ATAACGAAAGGCAATAGGGAAAA 
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Sequence: 33C7 contig From: 1 To: 2048 

10 20 30 40 50 60 

i i i 1 1 1 i i i 1 i i i 1 1 i i 1 1 1 i 1 1 1 1 1 i i i 1 i i 1 1 I 1 1 1 1 I ii i 1 1 1 i i i I i i i i I i i 1 1 1 

A6CTTATGCA TGCGGCCGCA TCTAGAGGGC CCGGATCCGG TGACCATCGG TCACCGGCAT 60 
GCCGGTGGTT TCGGTATCCA GTACGACGCT ACGCATCTAT AGAGCCTTTC TCTGTTTCGC 120 
TGCAGCCGTG GCTGCTGAAC GCTTGTTTCG GTGTGGCCGC TCAGCGCGGC AATTCGGCGA 180 
CGCCACGGTT GGCCAACTGG TCGGCCCGCT CGTTGCCGGG GTCGCCGGTA TGCCCGCGGA 240 
CCCACTGCCA CTCCACCTGG TGCCGGGCGA CCTGTTCATC CAGGGCCTGC CAGAGGTCGG 300 

310 320 330 340 350 360 

i i i i 1 1 i 1 1 I i i i i I i i i i I i i i i 1 1 1 ii 1 i i i i I 1 1 1 1 I i 1 1 1 1 1 i i i I 1 1 i i 1 1 1 1 1 I 

CATTCTTGAC AGGCTGCTTG CTGGCGGTCT TCCAGCCGCG CTTCTTCCAG TTCGGCAACC 360 

ATTCGGTGAT GCCGCGCATC ACGTATTCCG AGTCGGTGAT CAGACGGATC GGACAGGAAC 420 

GCTTGAGTGC CGCCAGCGCC TGGATCGCCG CCATCAGCTC CATGCGGTTG TTGGTGGTGT 480 

CCGGCTCGCC GCCCCAAAGC TCTCGCTCGG CGCCCTTGTA GAGGAGCAAC GCCCCCCAGC 540 

CGCCGCGCCC AGGGTTGCCC TTGCAGGCGC CGTCGGTATA GATCACTACC TGTTCTTTAT 600 

610 620 630 640 650 660 

i i i i 1 1 i i i I i i i i I i i i i I 1 1 i i 1 1 1 i i I i i i 1 1 1 1 1 1 I i i i 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 i I 

CTGTCATGCC TAAATTTCGG AATCTCGCCG GCTGACTTTC GCCACCGGCA TGGGCACCAG 660 
CTGACCGCGC GGTTCGCGCT TGCTCTGGCG CAACGGGCGC AACCCCACGA CCAGCTTGCG 720 
TGCCACCAAT AGATAGAAGC CGGCGCCCGA AGACTGCCAG GCGTCGCCCC AGCGCTCCAG 780 
GCGAGCCAGG CGCGATTGCC AGGCTGCCGA CGCAAGCGGC GGACGATAGC ACCCGAAGCG 840 
CCGTTTCTCC AGCGCGAAGC CCAGCAGGTT GAGCCAATCG CAGGCCCGCG ACGGAGGAAT 900 
910 920 930 940 950 960 

M I I I I I I 1 I I I I I I I I J I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

GCAGCGGGCC TGGCGCAAGG CATCCCCGGC GAAATAATGA CGGATGCCCC ACAGGCTCCA 960 
TGGGTTGATG CCGATCAGCA GCAGGTGGCC GCCCGGACGA ACGGTACGCG CGGCTTCGCG 1020 
CAGGAGACGG TGAGGCGACA GGCAGAAATC CAGGCCGTGT TGCAGCAGGA CCACGTCCGC 1080 
GGCATGTTCG CTGAGCGGCC AGGCGCCCTC TTCGCAGGCG ATGTCCACGC CCGGCAGCGG 1140 
CGGCCCCAGG CGCACGCCGC GCTGAATCTG CCCGGTGCTC GGCGGCAGTT CGGCATGCGG 1200 
1210 1220 1230 1240 1250 1260 

M I I I M M I I I I I I I I I I I I M I I I M I I M M I I II I I I I I I I I I I I I I I I I I II II I 

CCCGTAGTGC ACCAGGTAGC CACCGAAGTA ACGGGTCAGC TCGTCGCACA ACAGGCGTCG 1260 

CTCCTCGGCC AGCATCAGGC TGCCCAGCGG GCCCTGGAAC CAGTCGCGCG CCCGGTTGAT 1320 

CGATGCCAGC CACTCGGCAT CGGTCTGGGC GAAGGCTTGC GGTTCGTTCA TGCGTACCTC 1380 

CAGCGTCTTC CCCTTCGCGG CGACGGACGC CGGCACGACG GGAAAATAAG CAATACTATG 1440 

CGCCAATGAC TTCTGCTTAG CGACATCGAC CCATGATACA GATCGACGCC CTGCCCGCCT 1500 
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Sequence: 33C7 contig From: 1 To.- 2048 (continued) 

1510 1520 1530 1540 1550 1560 

i i i i I i i i i I i i i i I i i i i I i i i i I i i 1 1 I i i i i I i i i i I i i i i I i i i i I 1 1 i i I i i i i I 

TCAACGACAA CTACATCTGG CTGTTGCAAG ATGCGACAAG CCGTCGCTGC GCGGTGGTCG 1560 

ACCCCGGCGA TGCCAAGCCG GTGGAAGCCT GGCTGGCCGC CCATCCCGAC TGGCGGTTGA 1620 

GCGATATCCT GGTGACCCAC CACCATCACG ACCACGTCGG CGGCGTCGCG GCCCTGAAGG 1680 

AACTGACCGG CGCGCGGGTT CTCGGCCCGG CCAACGAGAA GATCCCGGCC CGCGACCTGG 1740 

CGCTGGAAGA CGGCGAACGG GTCGAGGTGC TCGGCCTGGT CTTCGAGATC TTCCACGTGC 1800 

1810 1820 1830 1840 1850 1860 

i i i i I i i i i I i i i i I i i i 1 1 i i i i I i i i 1 1 i i i i 1 1 i i i I 1 1 i i I i i i i I 1 1 i i I i i i i I 

CCGGCCATAC CCTCGGCCAT ATCGCCTACT ACCACCCGGC GGAGACGCCG CTGCTGTTCT 1860 

GCGGCGACAC CCTGTTCGCC GCCGGCTGCG GCCGTCTCTT CGAAGGCACC CCGGCGCAGA 1920 

TGCACCATTC CCTGGCGCGA CTGGCCGCGC TGCCGGCCAA CACCCGGGTC TACTGCACCC 1980 

ACGAGTACAC GCTGAGCAAC CTGCGCTTCG CGCTGGCGGT GGAGCCCGAC AACGCGGCGC 2040 
TGCGGGAA 2048 

Fig. 32C 



33C7 ORF A 

ATGAACGAAC CGCAAGCCTT 
GGCTGGCATC GATCAACCGG 
CCCGCTGGGC AGCCTGATGC 
TTGTGCGACG AGCTGACCCG 
TGCACTACGG GCCGCATGCC 



CGCCCAGACC GATGCCGAGT 40 
GCGCGCGACT GGTTCCAGGG 80 
TGGCCGAGGA _GCGACGCCTG 120 
TTACTTCGGT GGCTACCTGG 160 
GAACTGCCGC CGAGCACCGG 200 



GCAGATTCAG CGCGGCGTGC GCCTGGGGCC GCCGCTGCCG 240 
GGCGTGGACA TCGCCTGCGA AGAGGGCGCC TGGCCGCTCA 280 
GCGAACATGC CGCGGACGTG GTCCTGCTGC AACACGGCCT 320 
GGATTTCTGC CTGTCGCCTC ACCGTCTCCT GCGCGAAGCC 360 
GCGCGTACCG TTCGTCCGGG CGGCCACCTG CTGCTGATCG 400 



GCATCAACCC ATGGAGCCTG TGGGGCATCC GTCATTATTT 440 
CGCCGGGGAT GCCTTGCGCC AGGCCCGCTG CATTCCTCCG 480 
TCGCGGGCCT GCGATTGGCT CAACCTGCTG GGCTTCGCGC 520 
TGGAGAAACG GCGCTTCGGG TGCTATCGTC CGCCGCTTGC 560 
GTCGGCAGCC TGGCAATCGC GCCTGGCTCG CCTGGAGCGC 600 



TGGGGCGACG CCTGGCAGTC TTCGGGCGCC GGCTTCTATC 640 
TATTGGTGGC ACGCAAGCTG GTCGTGGGGT TGCGCCCGTT 680 
GCGCCAGAGC AAGCGCGAAC CGCGCGGTCA GCTGGTGCCC 720 
ATGCCGGTGG CGAAAGTCAG CCGGCGAGAT TCCGAAATTT 760 
AG 762 
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Sequence: 33C7 ORF A From: 1 To: 254 

10 20 30 40 

i i i i I i i i i I i i i i 1 1 i i i I i i i i I i i i i I i i i i I i i i i I 

MNEPQAFAQT DAEWLASINR ARDWFQGPLG SLMLAEERRL 40 
LCDELTRYFG GYLVHYGPHA ELPPSTGQIQ RGVRLGPPLP 80 
GVDIACEEGA WPLSEHAADV VLLQHGLDFC LSPHRLLREA 120 
ARTVRPGGHL LLIGINPWSL WGIRHYFAGD ALRQARCIPP 160 
SRACDWLNLL GFALEKRRFG CYRPPLASAA WQSRLARLER 200 

WGDAWQSSGA GFYLLVARKL WGLRPLRQS KREPRGQLVP 240 
MPVAKVSRRD SEI. 254 

Fig. 32E 

Sequence: 33C7 ORF B From: 1 To: 801 

10 20 30 40 50 60 

i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 1 i i i I i i i i I i i i i I i i i i 1 1 i i i I 

ATGGAGCCTG TGGGGCATCC GTCATTATTT CGCCGGGGAT GCCTTGCGCC AGGCCCGCTG 60 
CATTCCTCCG TCGCGGGCCT GCGATTGGCT CAACCTGCTG GGCTTCGCGC TGGAGAAACG 120 
GCGCTTCGGG TGCTATCGTC CGCCGCTTGC GTCGGCAGCC TGGCAATCGC GCCTGGCTCG 180 
CCTGGAGCGC TGGGGCGACG CCTGGCAGTC TTCGGGCGCC GGCTTCTATC TATTGGTGGC 240 
ACGCAAGCTG GTCGTGGGGT TGCGCCCGTT GCGCCAGAGC AAGCGCGAAC CGCGCGGTCA 300 
310 320 330 340 350 360 

i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i 1 1 i 1 i i i i I i i i i I 1 1 i i 1 1 i i i I 

GCTGGTGCCC ATGCCGGTGG CGAAAGTCAG CCGGCGAGAT TCCGAAATTT AGGCATGACA 360 
GATAAAGAAC AGGTAGTGAT CTATACCGAC GGCGCCTGCA AGGGCAACCC TGGGCGCGGC 420 
GGCTGGGGGG CGTTGCTCCT CTACAAGGGC GCCGAGCGAG AGCTTTGGGG CGGCGAGCCG 480 
GACACCACCA ACAACCGCAT GGAGCTGATG GCGGCGATCC AGGCGCTGGC GGCACTCAAG 540 
CGTTCCTGTC CGATCCGTCT GATCACCGAC TCGGAATACG TGATGCGCGG CATCACCGAA 600 

610 620 630 640 650 660 

i i i i I i 1 1 i 1 i i i i I i i i i I i ii 1 1 i i i i I i i i i I i 1 1 i 1 i i i i I i i i i I 1 1 i i I i i i i I 

TGGTTGCCGA ACTGGAAGAA GCGCGGCTGG AAGACCGCCA GCAAGCAGCC TGTCAAGAAT 660 
GCCGACCTCT GGCAGGCCCT GGATGAACAG GTCGCCCGGC ACCAGGTGGA GTGGCAGTGG 720 
GTCCGCGGGC ATACCGGCGA CCCCGGCAAC GAGCGGGCCG ACCAGTTGGC CAACCGTGGC 780 
GTCGCCGAAT TGCCGCGCTG A 801 

Fig. 32F 

Sequence: 33C7 ORF B PROTEIN From: 1 To: 267 

10 20 30 40 50 

i i M I i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i 1 1 i i i i I i i i i I i i 1 1 I 

MEPVGHPSLF RRGCLAPGPL HSSVAGLRLA QPAGLRAGET ALRVLSSAAC 50 
VGSLAIAPGS PGALGRRIiAV FGRRLLSIGG TQAGRGVAPV APEQARTARS 100 
AGAHAGGESQ PARFRNLGMT DKEQWIYTD GACKGNPGRG GWGALLLYKG 150 
AERELWGGEP DTTNNRMELM AAIQALAALK RSCPIRLITD SEYVMRGITE 200 
WLPNWKKRGW KTASKQPVKN ADLWQALDEQ VARHQVEWQW VRGHTGDPGN 250 
260 270 280 290 300 

i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 i i 1 1 I 

ERADQLANRG VAELPR. 267 

Fig. 32G 
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33C7 ORF C 

10 20 30 40 50 60 

i i i i 1 i i i i I i i i i I i ii i I 1 1 i 1 1 i i i i I i i i i I i i i i I i i i 1 1 i i i i I i i ii I i i i 1 1 

ATGACGGATG CCCCACAGGC TCCATGGGTT GATGCCGATC AGCAGCAGGT GGCCGCCCGG 60 
ACGAACGGTA CGCGCGGCTT CGCGCAGGAG ACGGTGAGGC GACAGGCAGA AATCCAGGCC 120 
GTGTTGCAGC AGGACCACGT CCGCGGCATG TTCGCTGAGC GGCCAGGCGC CCTCTTCGCA 180 
GGCGATGTCC ACGCCCGGCA GCGGCGGCCC CAGGCGCACG CCGCGCTGAA TCTGCCCGGT 240 
GCTCGGCGGC AGTTCGGCAT GCGGCCCGTA GTGCACCAGG TAGCCACCGA AGTAACGGGT 300 
310 320 330 340 350 360 

I I I I I I I M I I I I I 1 I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I M II I I I I I I 

CAGCTCGTCG CACAACAGGC GTCGCTCCTC GGCCAGCATC AGGCTGCCCA GCGGGCCCTG 360 
GAACCAGTCG CGCGCCCGGT TGATCGATGC CAGCCACTCG GCATCGGTCT GGGCGAAGGC 420 
TTGCGGTTCG TTCATGCGTA CCTCCAGCGT CTTCCCCTTC GCGGCGACGG ACGCCGGCAC 480 
GACGGGAAAA TAAGCAATAC TATGCGCCAA TGA 513 

Fig. 32H 



Sequence: 33C7 ORF C PROTEIN From: 1 To: 171 

10 20 30 40 50 

i i ii 1 i i i i 1 i 1 1 i 1 1 1 i i I i i i i I i i i i 1 i i i i I i i 1 1 1 i i i i I i i i i I 

MTDAPQAPWV DADQQQVAAR TNGTRGFAQE TVRRQAEIQA VLQQDHVRGM 50 
FAERPGALFA GDVHARQRRP QAHAALNLPG ARRQFGMRPV VHQVATEVTG 100 
QLVAQQASLL GQHQAAQRAL EPVARPVDRC QPLGIGLGEG LRFVHAYLQR 150 
LPLRGDGRRH DGKISNTMRQ . 171 



Fig. 321 
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1G2 SEQ ID NO:137 

1 NTTGTGTTAA GATCAGGCTT GGTGGTGAAG AAAGGTTCGA ACNNGTGGTC 

51 AATGATCNAC TTCGGGGATN CNGCTGCCCG TATNATTCAA CACGTGGTCA 

101 AACGGTATGT TCCGAGGCGT CTGNCCACCN GTACTAGTCG ACGC 



Fig. 33 
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Fig. 34C 
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